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NOVEL FDRG PROTEIN AND NUCLEIC ACID MOLECULES 
AND USES THEREFOR 

Government Funding 

5 This invention was made in part with government support under USPHS Grant 

DK3 1405. awarded by the National Institutes of Health. The government may have 
certain rights in this invention. :* ■ 

Background of the Invention 

10 Blood vessel formationoccurs by two processes, vasculogenesis and 

angiogenesis. Vasculogenesis involves establishment of a primitive; vascular network 
during embryogenesis. Subsequently, angiogenesis remodels this network to form the 
mature cardiovascular system. In addition, angiogenesis takes place in adult tissues 
during tissue repair, the reshaping of female reproductive tissues and tumor growth and 

1 5 metastasis. (Maisonpierre et al ( 1997) Science 277:55-60). 

Endothelial cells have been found to be centrally involved in both vasculogenesis 
and angiogenesis. These cells migrate, proliferate and assemble to form tubes with tight 
cell to cell connections. Peri-endothelial support cells are then recruited to encase the 
endothelial tube to thereby provide maintenance and modulary functions to the blood 

20 vessels. (Hanahan (1997) Science 277:48-50). 

The process of establishing arid remodeling of blood vessels is controlled by the 
interaction of several protein ligands and tyrosine kinase-like receptors expressed on 
endothelial cells. Such receptors include, for example, the recently discovered TIE 
receptor family consisting of TIE1 and TIE2 which have been found to be critically 

25 involved in the formation of vasculature. Dumont et al (1994) Genes Dev. 8:1 897- 
1909; Sato et al (1995) Nature 376:70-74: Moreover, ligands which bind to the TIE2 
receptor and belong to the protein family which includes fibrinogen and ficolin have 
been discovered. These ligands include ahgiopoietin-1 which binds and induces 
phosphorylation of TIE2 (Davis et al (1996) Cell 87:1 16M 169) and angiopoietin-2 

30 which acts as a natural antagonist for angiopoietin-1 and HE2 (Maisonpierre (1997) 
Science 277:55-60). Both of these ligands, angiopoietin-l and angippoietin-2, have been 
implicated in blood vessel formation and remodeling. 
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Recently, an interesting connection was made between the divergent fields of 
angiogenesis and obesity. In particular, it was demonstrated that leptin, a protein 
secreted by fat cells, functions as both a metabolic hormone and a bona fide angiogenic 
factor (Sierra-Honigmann etal. (1998) Science 281:1683-1686). The finding that a 

5 factor secreted by fat cells having a known role in metabolism and obesity also plays a 
role in angiogenesis demonstrates a novel link between adipocytes and the vasculature. 
The complex functionality of leptin raises the possibility that there exist other molecules 
which may play a dual role in both angiogenesis and obesity: Accordingly, there exists 
a need to identify and characterize such molecules and to understand me mechanism by 

1 0 which organisms utilize one factor in controlling multiple cellular functions. Moreover, 
there exists a need for identifying molecules with which such dual regulators interact as 
well as identifying modulators of such dual regulators. 

Summary of the Invention * 

15 The present invention is based, at least in part, on the discovery of novel secreted 

molecules having homology to angiopoietin which are believed to modulate 
angiogenesis and mediate the contribution of adipocytes to various metabolic processes. 
These novel molecules are referred to herein as Fibrinogen Domain Related (referred to 
herein as "FDRG" or TANGO 115) nucleic acid and protein molecules and are useful as 

20 modulating agents in regulating a variety of cellular processes {e.g., angiogenesis-related 
' and metabolism-related processes). Accordingly, in one aspect, this invention provides 
isolated nucleic acid molecules encoding FDRG proteins or biologically active portions 
thereof, as well as nucleic acid fragments suitable as primers or hybridization probes for 

the detection of FDRG-encoding nucleic acids, s 
25 in one embodiment, a FDRG nucleic acid molecule is 60% homologous to the 

nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID 
NO:l 1, SEQ ID NO: 13, SEQ ID NO:14, the nucleotide sequence of the DNA insert of 
the plasmid deposited with ATCC as Accession Number 98632, or a complement 
thereof. In another embodiment, an isolated FDRG nucleic acid molecule encodes the 
30 amino acid sequence of human FDRG shown in SEO/lD NO:2. In another embodiment, 
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an isolated FDRG nucleic acid molecule encodes the amino acid sequence of murine 
FDRG shown in SEQ ID NO: 12. 

In a preferred embodiment, an isolated FDRG nucleic acid molecule has the 
nucleotide sequence of SEQ ID NO: 1 , or a complement thereof. In another 
5 embodiment, a FDRG nucleic acid molecule further comprises nucleotides 1 - 1 65 of 
SEQ ID NO:l. In another embodiment, a FDRG nucleic acid molecule further comprises 
nucleotides 1 384-1 894 of SEQ ID NO: 1 . In yet another embodiment, a FDRG nucleic 
acid molecule comprises the nucleotide sequence of SEQ ID NO:3. 

In another preferred embodiment, an isolated FDRG nucleic acid molecule has 

1 0 the nucleotide sequence of SEQ ID NO: 1 1 , or a complement thereof. In another 
embodiment, a FDRG nucleic acid molecule further comprises nucleotides 1-148 of 
SEQ ID NO: 1 1 . In yet another embodiment, a FDRG nucleic acid molecule further 
comprises nucleotides 1379-1 893 of SEQ ID NO : 1 1 . In yet another embodiment, a 
FDRG nucleic acid molecule comprises the nucleotide sequence of SEQ ID NO: 13. 

1 5 In another preferred embodiment of the invention, an isolated FDRG nucleic acid 

molecule has the nucleotide sequence of SEQ ID NO:3, SEQ IDNO:4, SEQ ID NO:13, 
SEQ ID NO: 14 or a complement thereof. In yet another embodiment, a FDRG nucleic 
acid molecule comprises nucleotides 1 -556 of SEQ ID NO:3 or SEQ ID NO: 1 3 or 
nucleotides 1-481 of SEQ ID NO:4 or SEQ ID NO:14. 

20 In yet another preferred embodiment, an isolated FDRG nucleic acid molecule 

has the nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98632, or a complement thereof. 

In another embodiment, a FDRG nucleic acid molecule includes a nucleotide 
sequence encoding a protein having an amino acid sequence sufficiently homologous to • 

25 the amino acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO: 12, or SEQ ID 
NO: 1 5. In yet another embodiment, a FDRG nucleic acid molecule includes a 
nucleotide sequence encoding a protein having an amino acid sequence at least 60% 
homologous to the amino acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID 
NO: 12, or SEQ ID NO: 15. In a preferred embodiment, a FDRG nucleic acid molecule 

30 includes a nucleotide sequence encoding a protein having the amino acid sequence of 
SEQ ID NO: 2, SEQ ID NO:5, SEQ ID NO:12, or SEQ ID NO:15. 
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In another embodiment, an isolated nucleic acid molecule of the present 
invention encodes a FDRG protein which includes a signal sequence, an N-terminal 
unique domain, a C-terminal fibrinogen-like domain, and is secreted. In another 
embodiment, the FDRG nucleic acid molecule encodes a FDRG protein and is a 

5 naturally occurring nucleotide sequence. In yet another embodiment, an isolated nucleic 
acid molecule of the present invention encodes a FDRG protein and comprises a 
nucleotide sequence which hybridizes under stringent hybridization conditions to a 
nucleic acid molecule having the nucleotide sequence of SEQ ID NO:l , SEQ ID NO:3, 

, SEQIDNO:4,SEQIDNO:ll,SEQIDNO:13,SEQIDNO:14,oroftheDNAinsertof 

10 the plasmid deposited with ATCC as Accession Number 98632. 

. Another embodiment of the invention features FDRG nucleic acid molecules 
which specifically detect FDRG nucleic acid molecules relative to nucleic acid 
molecules encoding non-FDRG proteins. For example, in one embodiment, a FDRG 
nucleic acid molecule hybridizes under stringent conditions to a nucleic acid molecule 
15 comprising nucleotides 1-721 of SEQ ID NO: 1 or SEQ ID NO: 11. In another 
exemplary embodiment, a FDRG nucleic acid molecule hybridizes under stringent 
conditions to a nucleic acid molecule comprising nucleotides 1-556 of SEQ ID NO:3 or 
SEQ ID NO:13. In another embodiment, a FDRG nucleic acid molecule hybridizes 
under stringent conditions to a nucleic acid molecule comprising nucleotides 1-481 of 
20 SEQ ID NO:4 or SEQ ID NO:14. In another embodiment, the FDRG nucleic acid 
molecule is at least 500 to 550 nucleotides in length, 600 to 650 nucleotides in length, 
700 to 750 nucleotides in length, 800 to 850 nucleotides in length and hybridizes under 
stringent conditions to a nucleic acid molecule comprising the nucleotide sequence 
, shown in SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:l 1, SEQ ID NO:13, 
25 SEQ ID NO:14, the nucleotide sequence of the DNA insert of the plasmid deposited 
with ATCC as Accession Number 98632, or a complement thereof 

Another embodiment the invention provides an isolated nucleic acid molecule 
.« which is antisense to the coding strand of a FDRG nucleic acid. 

Another aspect of the invention provides a vector comprising a FDRG nucleic 
30 acid molecule. In certain embodiments, the vector is a recombinant expression vector. 
In another embodiment the invention provides a host cell containing a vector of the 
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invention. The invention also provides a method for producing FDRG protein by 
culturing in a suitable medium, a host cell of the invention containing a recombinant 
expression vector such that FDRG protein is produced. 

Another aspect of this invention features isolated or recombinant FDRG proteins 
5 and polypeptides. In one embodiment, an isolated FDRG protein has a signal sequence, 
ah N-terminal unique domain, a C-terminal fibrinogen-like domain, and is secreted. In 
another embodiment, an isolated FDRG protein has an N-terminal unique domain and a 
C-terminal fibrinogen-like domain. 

In another embodiment, an isolated FDRG protein has an amino acid sequence 

1 0 sufficiently homologous to the amino acid sequence of SEQ ID NO:2, SEQ ID NO:5, 
SEQ ID NO: 12 or SEQ ID NO: 15. In a preferred embodiment, a FDRG protein has an 
amino acid sequence at least about 60% homologous to the amino acid sequence of SEQ 
ID NO:2, SEQ ID NO:5, SEQ ID NO:12 or SEQ ID NO:15. In another embodiment, a 
FDRG protein has the amino acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID 

15 NO:12orSEQIDNO:15. 

Another embodiment of the invention features an isolated FDRG protein which 
is encoded by a nucleic acid molecule having a nucleotide sequence at least about 60% 
homologous to a nucleotide sequence of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, 
SEQ ID NO: 1 1, SEQ ID NO: 13, SEQ ID NO: 14, or the nucleotide sequence of the DNA 

20 insert of the plasmid deposited with ATCC as Accession Number 98632, or a 

complement thereof. This invention also features an isolated FDRG protein which is 
encoded by a nucleic acid molecule having a nucleotide sequence which hybridizes 
under stringent hybridization conditions to a nucleic acid molecule comprising the 
nucleotide sequence of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:l 1, 

25 SEQ ID NO:13, SEQ ID NO:14, the nucleotide sequence of the DNA insert of the 
plasmid deposited with ATCC as Accession Number 98632, or a complement thereof. 
In another embodiment, the polypeptide is fragment of a FDRG polypeptide comprising 
the amino acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:12, SEQ ID 
NO: 1 5, or an amino acid sequence encoded by the DNA insert of the plasmid deposited 

30 with ATCC as Accession Number 98632, where the fragment comprises at least 1 5 
contiguous amino acids of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:12, SEQ ID 
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NO: 1 5, or the amino acid sequence encoded by the DNA insert of the plasmid deposited 
with ATCC as Accession Number 98632 - 

The FDRG proteins of the present invention, or biologically active portions 
.thereof, can be operatively linked to a non-FDRG polypeptide to form FDRG fusion 

5 proteins. The invention further features antibodies that specifically bind FDRG proteins, 
such as monoclonal or polyclonal antibodies. In addition, the FDRG proteins or 
biologically active portions thereof can be incorporated into pharmaceutical 
compositions, which optionally include pharmaceutically acceptable carriers. 

In another aspect, the present invention provides a method for detecting FDRG 

10 expression in a biological sample by contacting the biological sample with an agent 
capable of detecting a FDRG nucleic acid molecule, protein or polypeptide such that the 
presence of FDRG nucleic acid molecule, protein or polypeptide; is detected in the 

biological sample. ■ \.' 

In another aspect, the present invention provides a method for detecting the 
15 presence of FDRG activity in a biological sample by contacting the biological sample 

with an agent capable of detecting an indicator of FDRG activity such that the presence 

of FDRG activity is detected in the biological sample. 

In another aspect, the invention provides a method for modulating FDRG activity 

comprising contacting the cell with an agent that modulates FDRG activity such that 
20 FDRG activity in the cell is modulated. In one embodiment, the agent inhibits FDRG 

activity. In another embodiment, the agent stimulates FDRG activity. In one 

embodiment, me agent is an antibody mat specifically binds to FDRG protein. In 

another embodiment, the agent modulates expression of FDRG by. modulating 
transcription of a FDRG gene or translation of a FDRG mRN A. In yet another 
25 embodiment, the agent is a nucleic acid molecule having a nucleotide sequence that is 
antisense to the coding strand of the FDRG mRN A or the FDRG gene. 

In one embodiment, me methods of the present invention are used to treat a 

subject having a disorder characterized by aberrant FDRG protein or nucleic acid 
expression or activity by administering an agent which is a FDRG modulator to the 
30 subject. In one embodiment, the FDRG modulator is a FDRG protein. In another 
embodiment, the FDRG modulator is a FDRG nucleic acid molecule. In yet another 
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embodiment, the FDRG modulator is a peptide, peptidomimetic, or other small 
molecule. In a preferred embodiment, the disorder characterized by aberrant FDRG 
protein or nucleic acid expression is a hematopoietic disorder, a differentiative or 
developmental disorder, a proliferative disorder or a cell recruitment disorder. 
5 The present invention also provides a diagnostic assay for identifying the 

presence or absence of a genetic alteration characterized by at least one of (i) aberrant 
modification or mutation of a gene encoding a FDRG protein; (ii) mis-regulation of said 
gene; and (iii) aberrant post-translational modification of a FDRG protein, wherein a 
wild-type form of said gene encodes an protein with a FDRG activity. 
1 0 In another aspect the invention provides a method for identifying a compound 

that binds to or modulates the activity of a FDRG protein, by providing an indicator 
composition comprising a FDRG protein having FDRG activity, contacting the indicator 
composition with a test compound, and determining' the effect of the test .cpmpound on 
FDRG activity in the indicator composition to identify a compound that modulates the 
15 activity of a FDRG protein. . 

In another aspect, the invention provides a method of treating obesity or diabetes 
in a subject which involves administering to the subject an agent (e.g., a small molecule 
modulator of FDRG, a FDRG nucleic acid molecule, or a FDRG antibody). In yet 
another aspect, the invention provides a method for diagnosing obesity or diabetes in a 
20 subject utilizing FDRG nucleic acid molecles. In yet another aspect, the invention 
provides a method for identifying compounds which modulate metabolic disorders 
utilizing cells which expresses a FDRG receptor. In yet another aspect, the invention 
provides a method of regulating angiogenesis in a subject. . ' < 

Other features and advantages of the invention will be apparent from the 
25 following detailed description and claims. 



Brief Description of the Drawings 

Figure 1 depicts the cDNA sequence and predicted amino acid sequences of 
human FDRG (also referred to as "hFDRG" or "hTl 15"). The nucleotide sequence 
30 corresponds to nucleic acids 1 to 1 894 of SEQ ID NO: 1 . The amino acid sequence 
corresponds to amino acids 1 to 406 of SEQ ID NO:2. 
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Figure 2, depicts the cDNA sequence and predicted amino add sequence of 
mature human FDRG. The nucleotide sequence corresponds to nucleic acids 1 to 1 143 
of SEQ ID NO:4. The amino acid sequence corresponds to amino acids 1 to 381 of SEQ 

IDNO:5. : ; "V 

Figure 3 depicts a multiple sequence alignment of the amino acid sequences of 
human FDRG (corresponding to amino acids 1 to 406 of SEQ ID NO:2), murine FDRG 
(corresponding to amino acids 1 to 41 0 of SEQ ID NO: 12), human angiopoietin 2 
(Genbank™ Accession No. AF004327) (SEQ IDNO:8), murine angiopoietin 2 
(Genbank™ Accession No. AF004326) (SEQ ID NO:9), and porcine ficolin (Genbank™ 
Accession No. L12345) (SEQ ID NO: 10). The alignment, was generated using 
MegAlign™ sequence alignment software. The initial pairwise alignment step was 
performed using a Lipman Pearson algorithm with a K-tuple of 1, a GAP penalty of 3, a 
window of 5, and diagonals saved set to 5, The multiple alignment step was performed 
using the Clustal algorithm with a PAM 250 residue weight Table, a GAP penalty of 10, 
and a GAP length penalty of 10. • 

Figure 4 depicts the cDNA sequence and predicted amino acid sequences of 
murine FDRG (also referred to as "mFDRG", "mTl 1 5", or BK89). The nucleotide 
sequence corresponds to nucleic acids 1 to 1893 of SEQ ID NO:l 1. The amino acid 
sequence corresponds to amino acids 1 to 410 of SEQ ID.NO: 12. 

Figure 5 depicts the cDNA sequence and predicted amino acid sequence of 
mature murine FDRG. The nucleotide sequence corresponds to nucleic acids 1 to 1161 
ofSEQ IDNO:14. The amino acid sequence corresponds to amino acids 1 to 387 of 
SEQIDNO:15. 

Figure 6 depicts a pairwise alignment of human and murine FDRG nucleotide • 
sequences (corresponding to SEQ ID NO: 1 and SEQ ID NO:l 1, respectively). The 
alignment was generated using the ALIGN program, version 2.0 (Myers and Miller 
(1989) CABIOS), scoring matrix = PAM1 20, gap penalties: -12/-4. 

Figure 7 depicts a pairwise alignment of human and murine FRDG amino acid 
sequences (corresponding to SEQ ID Np:2 and SEQ IDNO:12, respectively). The . 
alignment was generated as described in the legend of Fig. 6. 
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Detailcd Description of the Invention 

The present invention is based on the discovery of novel molecules having 
homology to the protein family that includes fibrinogen, angiopoietin(s), and ficolin, 
referred to herein as FDRG protein and nucleic acid molecules, which comprise a family 
5 of molecules having certain conserved structural and functional features. The nucleotide 
sequence of a human FDRG nucleic acid molecule and the amino acid sequence of the 
predicted human FDRG protein molecule are depicted in Figure 1. The nucleotide 
sequence of a murine FDRG nucleic acid molecule and the amino acid sequence of the 
predicted murine FDRG protein molecule are depicted in Figure 4. 

10 The term "family" when referring to the protein and nucleic acid molecules of 

the invention is intended to mean two or more protein or nucleic acid molecules having 
a common structural domain and having sufficient amino acid or nucleotide sequence 
homology as defined herein. Such family members can be naturally occurring and can 
be from either the same or different species. For example, a family can contain a first 

1 5 protein of human origin and a homologue of that protein of murine origin, as well as a 
second, distinct protein of human origin and a murine homologue of that protein. 
Members of a family may also have common functional characteristics. An exemplary 
family of the present invention is the fibrinogen-like protein family whose members 
include, at least, human angiopoietin 1 , human angiopoietin 2, ficolin and fibrinogen. 

20 Another exemplary family of the present invention is the FDRG protein family. 

In one embodiment, a FDRG family member is identified based on the presence 
of a "C-terminal fibrinogen-like domain" in the protein or corresponding nucleic acid 
molecule. As used herein, the term "C-terminal fibrinogen-like domain" includes a 
protein domain which is at least about 200-225 amino acid residues in length and has at 

25 least about 38-41% homology with human fibrinogen beta or gamma chain. In another 
embodiment, a "C-terminal fibrinogen-like domain" is at least about 100-300 amino acid 
residues in length, and preferably at least about 150-250 amino acid residues in length, 
and has at least about 25-60%, preferably at least about 30-50%, and more preferably at 
least about 35-45% homology with human fibrinogen beta or gamma chain. In yet 1 

30 another embodiment, a "C-terminal fibrinogen-like domain" refers to a protein domain , 
which is at least about 1 00-300 amino acid residues in length, preferably at least about 
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1 50-250 amino acid residues in length, and more preferably at least about 200 amino 
acid residues in length, and has at least about 25-60%,. preferably at least about 30-50%, 
more preferably at least about 35-45% homology, and even more preferably at least 
about 38-41% homology with amino acidresidues 184-400 of SEQ IDNO:2. In a 

5 preferred embodiment, a C-terminal fibrinogen domain is involved in protein-protein 
interactions. , v ' • 

To identify the presence of a "C-terminal fibrinogen-like domain" in a FDRG 
family member, the amino acid sequence of the protein family member can be searched 
v against a database of HMMs {e.g., the Pfam database, release 3.3) e.g., using the default 

10 parameters. For example, the search can be performed using the hmmsf program 

(family specific) and threshold score of 1 5 for determining a hit. hmmsf is available as 
part of the HMMER package of search programs (HMMER 2.1.1, Dec. 1998) which is 
freely distributed by the Washington University school of medicine. In one 
. embodiment, a hit to a fibrinogenC (fibrinogen beta and gamma chains, C-terminal 

15 globular domain) HMM having a score of at least 150-200, preferably at least 200-250, 
more preferably at least 250-300, and more preferably at least 300-350 or more is 
determinative of the presence of a C-terminal fibrinogen-like domain within a query 
protein. A search using the amino acid sequence of SEQ ID NO:2 was performed 
against the HMM database resulting in the identification of a C-terminal fibrinogen-like 

20 domain in the amino acid sequence of SEQ ID NO:2. Accordingly, in one embodiment 
of the invention, a FDRG protein has a C-teiminal fibrinogen-like domain at about 
amino acids 1 84-400 of SEQ ID NO:2. (Score of 1 93 against the fibrinogenC domain 
profile HMM Accession No. PF00147). In another embodiment, a FDRG protein has a 
V C-termmal fibrinogen-like domain domain at about amino acids 188-404 of SEQ ID 

25 NO:2. (Score of xxx.x). The C-terminal fibrinogen-like domains of human and mouse 
FDRG, as well as those of human angiopoietin, mouse angoipoietin-2, and ficolin are 
indicated by bold italics in Figure 3. . 

A C-terminal fibrinogen-like domain can further contain at. ieast about two, more 
preferably at least three, and even more preferably at least four cysteine residues which 

30 are conserved between FDRG molecules and members of the fibrinogen-like protein 
family {e.g., human angiopoietin 2, murine angiopoietin 2, human angiopoietin 1 , and 
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ficolin.) Preferably, the C-terminal fibrinogen-like domain of FDRG has cysteine 
residues which are located in the same or similar positions as cysteine residues in a * 
fibrinogen-like protein family member. For example, when a FDRG protein of the 
invention is aligned with a FDRG family member or a fibrogen-like protein family 
5 member for purposes of comparison (see e.g., Fig. 3) preferred cysteine residues of the 
invention are those in which cysteine residues in the amino acid sequence of FDRG are 
located in the same or similar position as the cysteine residues in other FDRG family or 
fibrinogen-like family members. As an illustrative embodiment, Fig. 3 shows cysteine 
residues located in the same or similar positions of the human FDRG protein 

1 0 (corresponding to SEQ ID NO:2) and human angiopoietin-2 in the following locations: 
amino acid number 1 88 of human FDRG and amino acid number 284 of the human 
angiopoietin 2 protein; amino acid number 2 1 6 of human FDRG and amino acid number 
313 of the human angiopoietin 2 protein; amino acid number 341 of human FDRG and 
amino acid number 437 of the human angiopoietin 2 protein; and amino acid number 

15 354 of human FDRG and amino acid number 450 of the human angiopoietin 2 protein. 
Conserved cysteines are indicated by asterices in Fig. 3: 

Another embodiment of the invention features FDRG molecules which contain 
an N-terminal unique domain. The term "N-terminal unique domain" as used herein, 
includes a protein domain of a FDRG protein family member which includes amino acid 

20 residues N-terminal to the N-terminus of a C-terminal fibrinogen-like domain in the 
amino acid sequence of the FDRG protein, e.g., a protein domain which includes amino 
acid residues from the N-terminal amino acid residue of the amino acid sequence of the 
protein to the N-terminus of the C-terminal fibrinogen-like 4 domain. As used herein, an 
"N-terminal unique domain" includes a protein domain which is at least about 170-190 

25 amino acid residues in length and has at least about 75-85% homology with the amino 
acid sequence of a second FDRG family member. In another embodiment, an N- 
terminal unique domain is at least about 1 50-210 amino acid residues in length, 
preferably at least about 160-200 amino acid residues in length, and has at least about 
65-95%, preferably at least about 70-90% homology with the amino acid sequence of a 

30 second FDRG family member. As further defined herein, an N-teiminal unique domain 
of a FDRG protein family member, however, is not sufficiently homologous to the 
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amino acid sequence of a member of another protein family, such as an angippoietin 
protein family. For example, an N-terminal unique domain of human FDRG (containing 
about amino acids 1 -1 83 of SEQ ID NO:2 has at least about 80% homology to the N- 
terminal unique domain of murine FDRG (amino acid residues 1-187 of SEQ ID NO: 12) 
5 but has no significant homology to the amino acid sequence of human angiopoietin 2. 

As further defined herein, an N-terminal unique domain may further contain an 
"N-terminal signal sequence". As used herein, a "signal sequence" refers to a peptide 
containing about 25 amino acids which occurs at the extreme N-terminal end of 
secretory and integral membrane proteins and which contains large numbers of 
10 hydrophobic amino acid residues. For example, a signal sequence contains at least about 
1 5-35 amino acid residues,.preferably about 1 8-32 amino acid residues, more preferably 
about 20-30 amino acid residues, and more'preferably about 22-28 amino acid residues, 
and has at least about 40-70%, preferably about 50-65%, and more preferably about 55- 
60% hydrophobic amino acid residues (e.g. , Alanine, Valine, Leucine, Isoleucine, 
1 5 Phenylalanine, or Proline). Such a "signal sequence", also referred to in the art as a 
"signal peptide", serves to direct a protein containing. such a sequence to a lipid bilayer. 

In a preferred embodiment, a FDRG protein contains both an N-terminal unique 
domain and a C-terminal fibrinogen-like domain. In another preferred embodiment, a 
FDRG N-terminal unique domain further contains a signal sequence. In one exemplary 
20 embodiment; a FDRG protein contains an N-terminal unique domain comprising amino 
acids 1-183 of SEQ ID NO:2 which further contains a signal sequence at amino acids 1- 
25 of SEQ ID NO:2 . In yet another exemplary embodiment, a FDRG protein contains 
an N-terminal unique domain comprising amino acids 1-158 of SEQ ID NO:5. In 
another exemplary embodiment, a FDRG protein contains an N-terminal unique domain 
25 comprising amino acids 1-1 87 of SEQ ID NO: 12 which further contains a signal 

sequence at amino acids 1-23 of SEQ ID NO:12. In yet another exemplary embodiment, 
a FDRG .protein contains an N-terminal unique domain comprising aniino acids 1-162 of 
SEQ ID NO: 15. 

In yet another embodiment, a FDRG protein encodes a mature FDRG protein. 
30 As used herein, the term "mature FDRG protein" refers to a FDRG protein from which 
the signal peptide has been cleaved. In an exemplary embodiment, a mature FDRG 
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protein contains amino acid residues 1-381 of SEQ ID NO:5. In another exemplary 
embodiment, a mature FDRG protein contains amino acid residues 1-387 of SEQ ID 
NO: 15. In a preferred embodiment, a FDRG protein is a mature FDRG protein which 
comprises a C-terminal fibrinogen-like domain. 
5 Preferred FDRG molecules of the present invention have an amino acid sequence 

sufficiently homologous to the amino acid sequence of SEQ ID NO:2, SEQ ID NO:5, 
SEQ ID NO: 1 2 or SEQ ID NO: 15. As usedherein, the term "sufficiently homologous" 
refers to a first amino acid or nucleotide sequence which contains a sufficient or 
minimum number of identical or equivalent (e.g., an amino acid residue which has a 

10 similar side chain) amino acid residues or nucleotides to a second amino acid or , . 
nucleotide sequence such that the first and second amino acid or nucleotide sequences 
share common structural domains and/or a common functional activity. For example, 
amino acid or nucleotide sequences which share common structural domains have at 
least about 60-65% homology, preferably* 70-75% homology, more preferably 80-85%, 

1 5 and more preferably 90-95% homology across the amino acid sequences of the domains 
and contain at least one, preferably two, and more preferably three or four structural 
domains, are defined herein as sufficiently homologous. Furthermore, amino acid or 
nucleotide sequences that share at least 60-65%, preferably 70-75%, more preferably 80- 
85%, and even more preferably 90-95% homology and share a common functional 

20 activity are defined herein as sufficiently homologous. 

As used interchangeably herein an "FDRG activity", "biological activity of 
FDRG" or "functional activity of FDRG", refers to an activity exerted by a FDRG 
protein, polypeptide or nucleic acid molecule on a FDRG responsive cell as determined 
in vivo, or in vitro, according to standard techniques. In one embodiment, a FDRG 

25 activity is a direct activity, such as an association with a cell-surface protein (e.g. , a 
FDRG receptor or FDRG binding protein). In another embodiment, a FDRG activity is 
an indirect activity, such as the modulation of the activity of a second/protein (e.g. a 
cellular signaling molecule) mediated by interaction of the FDRG protein with a cell 
surface protein. In a preferred embodiment, a FDRG activity is at least one or more of 

30 the following activities: (i) interaction of a FDRG protein in the extracellular milieu 
with a non-FDRG protein molecule (e.g., a FDRG receptor) on the surface of the same 
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cell which secreted the FDRG protein molecule; (ii) interaction of a FDRG protein in the 
extracellular milieu with a non-FDRG protein molecule (e.g., a FDRG receptor) on the 
surface of a different cell from that which secreted the FDRG protein molecule; (Hi) 
complex formation between a FDRG protein and a FDRG receptor; (iv) complex. 
5 formation between a FDRG protein and non-FDRG receptor; (v) modulation of signal, 
transduction via a cell surface protein (e.g., a FDRG receptor); (vi) modulation of the 
expression of adipogenic genes; (vii) modulation of a PP ARy function or modulation by 
a PPARy function; (viii) modulation of a cellular response to a PPARy ligand (e.g. , 
thiazolidinediones) and (ix) interaction of a FDRG protein with a second protein in the 
10 extracellular milieu (e.g., a FDRG binding protein). 

In yet another preferred embodiment, a FDRG activity is at least one or more of 
the following activities: (i) activation of a FDRG-dependent signal transduction 
pathway; (ii) modulation of angiogenesis; (iii) modulation of a hematopoiesis; (iy) 
modulation of the proliferation, development or differentiation of a FDRG-expressing 
15 cell or FDRG-receptor expressing cell (e.g., mediation of growth and/or differentiation 
of adipocytes, for example, white adipocytes or brown adipocytes); (v) modulation of 
the proliferation, development or differentiation of a non- FDRG-expressing cell or 
FDRG-expressing cell; (vi) modulation of the homeostasis of a FDRG-expressing cell; 
(vii) modulation of insulin sensitivity and/or insulin responsiveness; (viii) modulation of 
20 insulin secretion; (ix) modulation of cell recruitment; (x) modulation of the homeostasis 
of a non-FDRG-expressing cell; and (xi) modulation of glucose metabolism. 

The present invention is based, at least in part, on the discovery of a novel 
secreted protein (the human FDRG protein having the amino acid sequence of SEQ ID 
NO:2) which was isolated from an aortic endothelial cDNA library, is highly expressed 
25 in placenta and fat, and has homology to angiopoietin. The present invention is based 
further in part on the discovery of a murine FDRG protein having the amino acid 
sequence of SEQ ID NO:12 which was isolated in a screen for transcriptional targets of 
PPARy, a nuclear hormone receptor that can act as an adipogenic transcription factor 
and a potent regulator of systemic insulin action. 



WO 99/45135 PCT/US99/04548 

- 15- 

PPARy is a nuclear hormone receptor which exists in two isoforms (y 1 and y2) 
formed by alternative splicing (Zhii et al (1995) PNAS 92:7921-7925) and which 
appears to function as both a direct regulator of many fat-specific genes and also as a 
"master" regulator that can trigger the entire program of adipogenesis (Spiegelman and 
5 Flier ( 1 996) Cell 87:377-389). PPARy forms a heterodimer with RXRa and has been 
shown to bind directly to well characterized fat-specific enhancers from the adipocyte P2 
(aP2: Tontonoz (1994) Genes Dev. 8:1224-1234) and phlosphoenblpyruvate 
carboxylase (PEPCK) genes (Tontonoz (1994) Afo/. Cell Biol. 15:351-357). The 
identification of FDRG as a target of PPARy places it in an important biological context, 

1 0 particularly in light of the prominent role the latter molecule plays in adipocyte 
differentiation and insulin signaling. 

Murine FDRG is higly expressed in brown* and white fat and placenta and 
expression is acutely induced in cultured cells by PPARy ligands such as 
thiazolidinediones (TZDs). Murine FDRG expression is also upregiilated in tissues of 

1 5 rodents chronically treated with TZDs. Furthermore, FDRG mRNA levels are elevated 
in murine models of obesity including the ob/ob and db/db mice, and are regulated on an 
acute basis by nutrition. 

Accordingly, in another preferred embodiment, a FDRG activity is at least one or 
more of the following activities: (1) maintenance of energy homeostasis (e.g., regulating 

20 the balance and/or imbalance between energy storage and energy expenditure, for . 
example, increasing/decreasing energy expenditure); (2) regulation of adaptive 
thermogenesis (e.g. , regulation of the biogenesis of mitochondria, regulation of the 
% expression of mitocrondrial enzymes, regulation of expression of uncoupling proteins; 
(3) regulation of adiposity; (4) modulation of the efficiency of energy storage; (5) 

25 regulation of appetite; (6) regulation of vasculogenesis (/. e. , blood vessel formation); (7) 
regulation of tumor angiogenesis; (8) regulation of wound healing; (9) 
expansion/reduction of tumor mass; and (10) expansion/reduction of fat mass. 

Moreover, in another embodiment of the invention, a FDRG molecule or > 
preferably, a FDRG or FDRG receptor modulator, is useful forregulating, preventing 

30 and/or treating at least one or more of the following diseases or disorders: (1) obesity 



WO 99/45135 • . PCT/US99/04548 

" -16- 

(e.g, excessive storage of energy as fat or a chronic imbalance between energy intake 
and expenditure); (2) diseases or disorders that accompany obesity (e.g. , 
cardiovasculalar disorders); (3) diabetes {e.g., non : insulin dependent diabetes mellitus); 
(4) disorders of energy homeostasis; (5) metabolic abnormalities typical of obesity (e.g., 
hyperinsulinemia); (6) insulin resistance; (7) disorders associated with lipid metabolism 
(e.g. , cachexia); and (8) disorders, associated with abnormal vasculogenesis (e.g., cancers 
including, but not limited to, cancers of the epithelia (e.g. , carcinomas of the pancreas, 
stomach, liver, secretory glands (e.g., adenocarcinoma), bladder, lung, breast, skin (e.g., 
fibromatosis or malignant melanoma), reproductive tract including prostate gland, ovary, 
cervix and uterus); cancers of the hematopoietic and immune system (e.g., leukemias 1 
and lymphomas); cancers of the central nervous, brain system and eye (e.g., gliomas, 
neuroblastoma and retinoblastoma); and cancers of connective tissues, bone, muscles 
and vasculature (e.g., hemangiomas and sarcomas)). In another embodiment, a FDRG 
molecule or preferably, a FDRG or FDRG receptor modulator is an antidiabetic agent or 
an insulin sensitizing agent. In yet another embodiment, a FDRG molecule or 
preferably, a FDRG or FDRG receptor modulator is an angiogenic factor. 

Accordingly, another embodiment of the invention features isolated FDRG 
proteins and polypeptides having a FDRG activity. Preferred FDRG proteins have a C- 
terminal fibrinogen-like domain and a FDRG activity. In another embodiment, the 
FDRG protein has an N-terminal unique domain, a C-terminal fibrinogen-like domain, 
and a FDRG activity. In another preferred embodiment, the FDRG protein has at least 
at least a C-terminal fibrinogen-like domain, an N-terminal unique domain, a FDRG 
activity, and an amino acid sequence sufficiently homologous to an amino acid sequence 
of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO: 12 or SEQ ID NO: 15. 

In a particularly preferred embodiment, the FDRG protein and nucleic acid 
molecules of the present invention are human FDRG molecules. A human FDRG 
cDNA molecule was obtained from a human aortic endothelial cDNA library as 
described in Example 1 . The nucleotide sequences of the isolated human FDRG cDNA 
and the .predicted amino acid sequence of the human FDRG protein are shown in Figure 
1 and in SEQ ID NOs: 1 and 2, respectively. In addition, the nucleotide sequence 
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corresponding to the coding region of the human FDRG cDNA is represented as SEQ ID 
NO:3. 

The human FDRG I cDNA setforth in SEQ ID NO: 1 , is approximately 1 894 
nucleotides in length and encodes a protein which is approximately 406 amino acid 
5 residues in length : (SEQ ID NO:2). The human FDRG protein contains a signal 
sequence, an N-tefminal unique domain, and a C-terminal fibrinogen-Iike domain, as 
defined herein. A FDRG N-terminal unique domain can be found at least, for example, 
from about amino acids 1-183 of SEQ ID NO:2 and, for example, from about amino 
acids 1-158 of SEQ ID NO:5. A FDRG C-terminal fibrinogen-like domain can be found 

1 0 at least, for example, from about amino acids 1 84-400 of SEQ ID NO:2 and, for 

example, from about amino acids 1 59-374 of SEQ ID NQ:5. A signal sequence can be 
found at least, for example, from about amino, acids 1-25 of SEQ ID NO:2. The 
prediction of such a signal peptide can be made, for example, utilizing the computer , 
algorithm SIGNALP (Henrik, et al (1997) Protein Engineering 10:1-6). 

15 In another particularly preferred embodiment, the FDRG protein and nucleic acid 

molecules of the present invention are murine FDRG molecules. A murine FDRG 
cDNA molecule was obtained from a screen designed to delect nucleic acids regulated 
by PPARy, a fat-specific transcription factor, as described in Example 4. The nucleotide 
sequences of the isolated murine FDRG cDNA and the predicted amino acid sequence of 

20 the murine FDRG protein are shown in Figure 4 and in SEQ ID NOs: 1 1 and 1 2, 

respectively. In addition, the nucleotide sequence corresponding to the coding region of 
the murine FDRG cDNA is represented as SEQ ID NO: 1 3. 

The murine FDRG 1 cDNA set forth in SEQ ID NO:l 1, is approximately 1 893 
nucleotides in length and encodes a protein which is approximately 410 amino acid 

25 residues in length (SEQ ID NO: 12). The murine FDRG protein contains a signal 
sequence, an N-terminal unique domain, and a C-terminal fibrinogen-like domain, as 
defined herein. A FDRG N-terminal unique domain can be found at least, for example, 
from about amino acids 1 -1 87 of SEQ ID NO: 12 and, for example, from about amino 
acids 1 -1 62 of SEQ ID NO: 15. A FDRG C-terminal fibrinogen-like domain can be 

30 found at least, for example, from about amino acids 1 88-404 of SEQ ID NO: 1 2 and, for 
example, from about amino acids 163-379 of SEQ ID NO:15. A signal sequence can be 
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found at least, for example/from about amino acids 1,-23 of SEQ ID NO: 12, The 
' prediction of such a signal peptide can be made, for example, utilizing the computer 
algorithm SIGNALP (Henrik, et a//(1997) Protein Engineering 10:1-6). 

FDRG mRNA expression was particularly high in placenta and fat as determined 
5 by norther blot analysis of various human and mouse tissues (See Examples 2 and 8). 

An alignment of the amino acid sequences of human FDRG, murine FDRG, 
human angiopoietin 2 (Genbank™ Accession No. AF004327), murine angiopoietin 2 
(Genbank™ Accession No. AF004326), and ficoiin (Genbank™ Accession No. LI2345) 
is shown in Figure 3. (The alignment was generated using MegAlign™ sequence 
10 alignment software. The initial pairwise alignment step was performed using a Lipman 
Pearson algorithm with a K-tuple of 1, a GAP penalty of 3, a windowof 5, and 
diagonals saved set to 5. The multiple alignment step was performed 
algorithm with a PAM 250 residue weight Table, a GAP penalty of 10, and a GAP 
length penalty of 10.) 

15 Various aspects ofthe invention are described in further detail in the following 

subsections: 

I. Isolated Nucleic Acid Molecules 

One aspect ofthe invention pertains to isolated nucleic acid molecules that 
20 encode FDRG proteins or biologically active portions thereof, as well as nucleic acid . 

fragments sufficient for use as hybridization probes to identify 
acids (e.g., FDRG mRNA) and fragments for use as PCR primers for the amplification 
or mutation of FDRG nucleic acid molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DN A molecules (e.g., cDNA or genomic DNA) and 

25 RNA molecules (e.g., mRNA) and analogs of the DNA or RNA generated using 
nucleotide analogs. The nucleic acid molecule can be single-stranded of double- 
stranded, but preferably is double-stranded DNA. 

An "isolated" nucleic acid molecule is one which is separated from. other nucleic 
acid molecules which are present in the natural source ofthe nucleic acid. Preferably, an 

30 "isolated" nucleic acid is free of sequences which naturally flank the nucleic acid (i.e., 
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sequences located at the 5' and 3' ends of the nucleic acid) in the genomic DNA of the 
organism from which the nucleic acid is derived. For example, in various embodiments, 
the isolated FDRG nucleic acid molecule can contain less than about 5 kb, 4 kb, 3 kb, 2 
kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid 
5 molecule in genomic DN A of the cell from which the nucleic acid is derived. Moreover, 
an "isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free 
of other cellular material, or culture medium when produced by recombinant techniques, 
or substantially free of chemical precursors or other chemicals when chemically 
synthesized. 

1 0 A nucleic acid molecule of the present invention, e.g. , a nucleic acid molecule 

having the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:4, SEQ ID 
NO:l 1, SEQ ID NO:i3, SEQ ID NO:14, the nucleotide sequence of the DNA insert of 
the plasmid deposited with ATCC as Accession Number 98632, or a portion thereof, can 
be isolated using standard molecular biology techniques and the sequence information 

1 5 provided herein. Using all or portion of the nucleic acid sequence of SEQ ID NO: 1 , 
SEQIDNO:3,SEQIDNO:4,SEQIDNO:ll,SEQIDNO:13,SEQIDNO:14,orthe . 
nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98632, as a hybridization probe, FDRG nucleic acid molecules can, 
be isolated using standard hybridization and cloning techniques {e.g., as described in 

20 Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual 
2nd, ed. t Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989). 

Moreover, a nucleic acid molecule encompassing all or a portion of SEQ ID * 
NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:l 1, SEQ ID NO:13, SEQ ID NO:14, 

25 or the nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98632, can be isolated by the polymerase chain reaction (PCR) using 
synthetic oligonucleotide primers designed based upon the sequence of SEQ ID NO:l, 
SEQ ID NO:3, SEQ ID NO:4, or the nucleotide sequence ofthe DNA insert of the 
plasmid deposited with ATCC as Accession Number 98632. 
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A nucleic acid of the invention can be amplified using cDNA, mRNA or 
alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 
according to standard PCR amplification techniques. The nucleic acid so amplified can 
be cloned into an appropriate vector and characterized by DNA sequence analysis. 
5 Furthermore, oligonucleotides'corresponding to FDRG nucleotide sequences can be 
■ prepared by standard synthetic techniques, e.g., using an automated DNA synthesizer. 

; In a preferred embodiment, an isolated nucleic acid molecule of the invention ." 
comprises the nucleotide sequence shown in SEQ ID NO:l. The sequence of SEQ ID 
NO:l corresponds to a human FDRG cDNA. This cDNA comprises sequences 
10 encoding a human FDRG protein (i.e., "the coding region", from nucleotides 166-1386), 
as well as 5' untranslated sequences (nucleotides 1 to 1 65) and 3' untranslated sequences 
(nucleotides 1 3 87 to 1 894). Alternatively, the nucleic acid molecule can comprise only 
the coding region of SEQ ID NO:l (e.g., nucleotides 166 to 1386). 

In another preferred embodiment, an isolated nucleic acid molecule of the, 
15 invention comprises the nucleotide sequence shown in SEQ ID NO:3. The sequence of 
SEQ ID NO:3 corresponds to the human FDRG cDNA. This cDNA comprises 
sequences encoding the human FDRG protein (i.e., "the coding region"; from 
nucleotides 1 to 1218 of SEQ ID N0:3). o 

• In another preferred embodiment, an isolated nucleic acid molecule of the 
20 invention comprises the nucleotide sequenceshown in SEQ ID NO:4. The sequence of 
SEQ ID NO:4 corresponds to human FDRG cDNA. This cDNA comprises sequences 
encoding the mature FDRG protein (i.e., from nucleotides 241-1386 of SEQ ID NO:l 
after the signal sequence has been cleaved). - * • ■ 

In yet another preferred embodiment, an isolated nucleic acid molecule of the 
25 . invention comprises the nucleotide sequence .of the DNA insert of the plasmid deposited 
with ATCC as Accession Number 98632. A plasmid containing the full length 
nucleotide sequence encoding FDRG was deposited with the American Type Culture 
Collection (ATCC), Rockville, Maryland, on January 14, 1997 and assigned Accession 
. Number 98632. . 
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In yet another preferred embqdiment,.an isolated nucleic acid molecule of the 
invention comprises the nucleotide sequence shown in SEQ ID NO: 13. The sequence of 
SEQ ID NO: 1 3 corresponds to a murine FDRG cDNA. This cDNA comprises 
sequences encoding a murine FDRG protein (i.e., "the coding region", from nucleotides 
5 149-1 378), as well as 5' untranslated sequences (nucleotides 1 to 148) and 3' 
untranslated sequences (nucleotides 1379 to 1893). Alternatively, the nucleic acid 
molecule can comprise only the coding region of SEQ ID NO: 1 3 (e.g. , nucleotides 1 49 
to 1378). 

In another preferred embodiment, an isolated nucleic acid molecule of the ' 

1 0 invention comprises the nucleotide sequence shown in SEQ ID NO: 13. The sequence of 
SEQ ID NO: 1 3 corresponds to the murine FDRG cDNA. This cDNA comprises 
sequences encoding the murine FDRG protein (/.*., "the coding region", from 
nucleotides 1 to 1230 of SEQ ID NO:13). 

In another preferred embodiment, an isolated nucleic acid molecule of the 

1 5 invention comprises the nucleotide sequence shown in SEQ ID NO: 1 4. The sequence of 
SEQ ID NO: 14 corresponds to murine FDRG cDNA. This cDNA comprises sequences 
encoding the mature FDRG protein (i.e., from nucleotides 710-1380 of SEQ ID NO:ll 
after the signal sequence has been cleaved). _ 
In another preferred embodiment, an isolated nucleic acid molecule of the 

20 invention comprises a nucleic acid molecule which is a complement of the nucleotide 
sequence shown in SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:l 1, SEQ 
ID NO: 13, SEQ ID NO: 14, the nucleotide sequence of the DNA insert of the plasmid 
deposited with ATCC as Accession Number 98(532, or a portion of any of these 
nucleotide sequences. A nucleic acid molecule which is complementary to the 

25 nucleotide sequence shown in SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID NO:4, SEQ ID . 
NO:l 1, SEQ ID NO: 13, SEQ ID NO: 14, or the nucleotide sequence of the DNA insert 
of the plasmid deposited with ATCC as Accession Number 98632 is one which is 
sufficiently complementary to the nucleotide sequence shown in SEQ ID NO:l, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:l 1, SEQ ID NO:13, SEQ ID NO:14, or the nucleotide 

30 sequence of the DNA insert of the plasmid deposited with ATCC as Accession Number 
98632 such that it can hybridize to the nucleotide sequence shown in SEQ ID NO:l, 
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SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:l 1, SEQ ID NO:13„SEQ ID NO:14,.or the 
nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98632, thereby forming a stable duplex. 

In still another preferred embodiment, an isolated nucleic acid molecule of the 
5 present invention comprises a nucleotide sequence which is at least about 60-65%, 
preferably at least about 70-75%, more preferable at least about 80-85%, and even more 
preferably at least about 90-95% or more homologous to the nucleotide sequence shown 
in SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:ll, SEQ ID NO:13, SEQ 
ID NO: 14, the nucleotide sequence of the DNA insert of the plasmid deposited with 
10 ATCC as Accession Number 98632, or a portion of any of these nucleotide sequences. 
In still another preferred embodiment, an isolated nucleic acid molecule of the present 
invention comprises a nucleotide sequence which is at least about 61%, 62%, 63%, 64%, 
65%, 66%, 67%, 68%, or 69% homologous, preferably at least about 71%, 72%, 73%, 
74%, 75%, 76%, 77%, 78%, or 79% homologous, more preferably at least about 81%, 
1 5 82%, 83%, 84%, 85%, 86%, 87%, 88%, or 89% homologous, and even more preferably 
at least about 91%, 92%, 93%, 94%, 95%,96%, 97%, 98%, or 99% or more 
homologous to the nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:l 1, SEQ ID NO: 13, SEQ ID NO:14, the nucleotide sequence of the 
DNA insert of the plasmid deposited with ATCC as Accession Number 98632, or a 
20 portion of any of these nucleotide sequences. 

Moreover, the nucleic acid molecule of the invention can comprise only a portion 
of the nucleic acid sequence of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID 
NO:l 1, SEQ ID NO:13, SEQ ID NO:14, or the nucleotide-sequence of the DNA insert 
of the plasmid deposited with ATCC as Accession Number 98632, for example a 
25 fragment which can be used as a probe or primer or a fragment encoding a biologically 
active portion of FDRG. The nucleotide sequence determined from the cloning of the 
human and murine FDRG genes allows for the generation of probes and primers 
designed for use in identifying and/or cloning FDRG homologues in other cell types, 
e.g. , from other tissues, as well as FDRG homologues from other mammals. The 
30 probe/primer typically comprises substantially purified oligonucleotide. The 

oligonucleotide typically comprises a region of nucleotide sequence that hybridizes 
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under stringent conditions to at least about 12, preferably about 25, more preferably 
about 40, 50 or 75 consecutive nucleotides of SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:l 1, SEQ ID NO:13, SEQ ID NO: 14, or the DNA insert of the - 
plasmid deposited with ATCC as Accession Number 98632 sense, or an anti-sense 
5 sequence of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:l 1, SEQ ID 
„ NO:13, SEQ ID NO:14, or the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98632, or of a naturally occurring mutant of SEQ ID NO: 1 , SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO: 1 1, SEQ ID NO: 13, SEQ ID NO:14, or the DNA 
insert of the plasmid deposited with ATCC as Accession Number 98632. 

10 Probes based on the human FDRG nucleotide sequence can be used to detect 

transcripts or genomic sequences encoding the same or homologous proteins. In 
preferred embodiments, the probe further comprises a label group attached thereto; e.g. , 
the label group can be a radioisotope, a fluorescent compound, an enzyme, or an enzyine 
co-factor. ' Such probes can be used as a part of a diagnostic test kit for identifying cells 

15 or tissue which misexpress a FDRG protein, such as by measuring a level of a FDRG- 
encoding nucleic acid in a sample of cells from a subject e.g., detecting FDRG mRNA 
levels or determining whether a genomic FDRG gene has been mutated or deleted. 

A nucleic acid fragment encoding a "biologically active portion of FDRG" can . 
be prepared by isolating a portion of SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID NO:4, SEQ 

20 ID NO:l 1, SEQ ID NO: 13, SEQ ID NO: 14, or the DNA insert of the plasmid deposited 
with ATCC as Accession Number 98632 which encodes a polypeptide having a FDRG 
biological activity (the biological activities of the FDRG proteins have previously been 
described), expressing the encoded portion of FDRG protein (e.g. , by recombinant 
expression in vitro) and assessing the activity of the encoded portion of FDRG. / 

25 The invention further encompasses nucleic acid molecules that differ from the 

nucleotide sequence shown in SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID NO:4 and from the 
nucleotide sequences of the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98632, or portions thereof, due to degeneracy of the genetic code and 
thus encode the same FDRG protein as that encoded by the nucleotide sequence shown 

30 in SEQ ID NO:l, SEQ ID NO:3 or SEQ ID NO:4 or by the nucleotide sequence of the 
DNA insert of the plasmid deposited with ATCC as Accession Number 98632. In . 
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* another embodiment, an isolated nucleic acid molecule of theinvention has a nucleotide 
sequence encoding a protein having an amino acid sequence shown in SEQ ID NO:2, . . » . 
SEQ ID NO:5, SEQ ID NO:12 or SEQ ID NO:15. 

In addition to the human FDRG nucleotide sequences shown in SEQ ID NO: 1 , 

5 SEQ ID NO:3 and SEQ ID NO:4, respectively, it will be appreciated by those skilled in 
the art that DN A sequence polymorphisms that lead to changes in the amino acid 
sequences of FDRG may exist within a population (e.g. , the human population). Such 
genetic polymorphism in the FDRG gene may exist among individuals within a 
population due to natural allelic variation. As used herein, the terms "gene" and 

1 0 "recombinant gene" refer to nucleic acid molecules comprising an open reading frame 
encoding a FDRG protein, preferably a mammalian FDRG protein. Such natural allelic 
variations can typically result in 1-5% variance in the nucleotide sequence of the FDRG 
gene. Any and all such nucleotide variations and resulting amino acid polymorphisms in 
FDRG that are the result of natural allelic variation and that do not alter the functional 

15 activity of FDRG are intended to be within the scope of the invention. 

V , Moreover, nucleic acid molecules encoding FDRG proteins from other species, 
and thus which have a nucleotide sequence which differs from the human sequences of 
SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:l 1, SEQ ID NO:13, SEQ ID 
NO: 14, or the nucleotide sequence of the DNA insert of the plasmid deposited with 

20 ATCC as Accession Number 98632 are intended to be within the scope of the invention. 
Nucleic acid molecules corresponding to natural allelic variants and homologues of the 
FDRG cDN A of the invention can be isolated based on their homology to the human 
FDRG nucleic acids disclosed herein using the human cDNA, or portions thereof, as a 
hybridization probe according to standard hybridization technique 

25 hybridization conditions. For example, a viral FDRG cDNA can be isolated based on its 

homology to human FDRG. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 1 5 nucleotides in length and hybridizes under stringent conditions to > 
the nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO:l, SEQ ID 
30 NO:3,SEQIDNO:4,SEQIDNO:ll,SEQIDNO:13,SEQIDNO:14,ortheDNA 
insert of the plasmid deposited with ATCC as Accession Number 9861 3. In other 
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embodiment, the nucleic acid is at least 30, 50, 100, 250 or 500 nucleotides in length. 
As used herein, the term "hybridizes under stringent conditions" is intended to describe 
conditions for hybridization and washing under which nucleotide sequences at least 60% 
homologous to each other typically remain hybridized to each other. Preferably, the 
5 conditions, are such that sequences at least about 65%, more preferably at least about 
70%, and even more preferably at least about 75% homologous to each other typically 
remain hybridized to each other. Such stringent conditions are known to those skilled in 
the art and can be found in Current Protocols in Molecular Biology, John Wiley,& Sons, 
N.Y. (1989), 6.3.1-6.3.6. A preferred, non-limiting example of stringent hybridization 

1 0 conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2 X SSC, 0.1% SDS at 50-65°C. Preferably, an 
isolated nucleic acid molecule of the invention that hybridizes under stringent conditions 
to the sequence of SEQ ID NO: 1 , SEQ ID NO:3 or SEQ ID NO:4 corresponds to a 
naturally-occurring nucleic acid molecule. As used herein, a "naturally-occuiTing" 

1 5 nucleic acid molecule refers to an RN A or DN A molecule having a nucleotide sequence 
that occurs in nature {e.g. , encodes a natural protein). , t . . * 

In addition to naturally-occurring allelic variants of the FDRG sequence that may 
exist in the population, the skilled artisan will further appreciate that changes can be 
introduced by mutation into the nucleotide sequence of SEQ ID NO: 1 , SEQ ID NO:3, 

20 SEQ ID NO:4, or the DNA insert of the plasmid deposited with ATCC as Accession 
Number 98632, thereby leading to changes in the amino acid sequence of the encoded 
FDRG protein, without altering the functional ability of the FDRG protein. For 
example, nucleotide substitutions leading to amino acid substitutions at "nonessential" 
amino acid residues can be made in the sequence of SEQ ID NO:l , SEQ ID NO:3, SEQ 

25 ID NO:4, SEQ ID NO:l 1, SEQ ID NO:13, SEQ ID NO:14, or the DNA insert of the 
plasmid deposited with ATCC as Accession Number 98632. A "non-essential" amino 
acid residue is a residue that can be altered from the wild-type sequence of FDRG (e.g., 
the sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:12 or SEQ ID NO:15) 
without altering the biological activity, whereas an "essential" amino acid residue is 

30 required for biological activity. For example, amino acid residues that are conserved 
among the FDRG proteins of the present invention, are predicted to be particularly 
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unamenable to alteration. Furthermore, amino acid residues that are conserved between 
FDRG protein and other proteins having fibrinogen-like domains are not likely to be 
amenable to alteration. 

* Accordingly, another aspect of the invention pertains to nucleic acid molecules 
5 encoding FDRG proteins that contain changes in amino acid residues that are not 

essential for.activity. Such FDRG.proteins differ in amino acid sequence from SEQ ID 
NO:2, SEQ ID NO:5, SEQ ID NO:12 or SEQ ID NO:15 yet retain biological activity. 
In one embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence 
encoding a protein, wherein the protein comprises an amino acid sequence at least about 
1 0 60-65% homologous to the amino acid sequence of SEQ ID NO:2. Preferably, the 
protein encoded by the nucleic acid molecule is at least about 70-75% homologous to 
SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:12, or SEQ ID NO:15, more preferably at 
least about 80-85% homologous to SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:12, or 
SEQ ID NO: 1 5, and even more preferably at least about 90-95% homologous to SEQ ID 
15 NO:2, SEQ ID NO:5, SEQ ID NO:12, or SEQ ID NO:15. 

, In an alternative embodiment, the isolated nucleic acid molecule comprises a 
nucleotide sequence encoding a protein, wherein the protein comprises an amino acid 
sequence at least about 61%, 62%, 63%; 64%, 65%, 66%, 67%, 68%, or 69% 
homologous, preferably at least about 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, or 
20 79% homologous, more preferably at least about 81%, 82%, 83%, 84%, 85%, 86%, 
87%, 88%, or 89% homologous, and even more preferably at least about 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98%, or 99% or more homologous to SEQ ID NO:2, SEQ 
ID NO:5, SEQ ID NO:12, or SEQ ID NO:15. 

An isolated nucleic acid molecule encoding a FDRG protein homologous to the 
25 protein of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:12 or SEQ ID NO:15 can be 
created by introducing one or more nucleotide substitutions, additions or deletions into 
the nucleotide sequence of SEQ ID NO: 1 , SEQ ID NO:3 or SEQ ID NO:4, or into the 
nucleotide sequence of the DN A insert of the plasmid deposited with ATCC as 
Accession Number 98632, such that one or more amino acid substitutions, additions or 
30 deletions are introduced into the encoded protein. Mutations can be introduced into * 
SEQIDNO:!, SEQIDNO:3, SEQIDNO:4, SEQ ID NO: 1.1, SEQIDNO:13, SEQ ID 



WO 99/45135 PCT/US99/04548 

-27- 

NO: 1 4, or the DN A insert of the plasmid deposited with ATCC as Accession Number 
98632 by standard techniques, such as site-directed mutagenesis and PCR-mediated 
mutagenesis. Preferably, conservative amino acid substitutions are made at one or more 
predicted non-essential amino acid residues. A "conservative amino acid substitution" is 
5 one in which the amino acid residue is replaced with an amino acid residue having a 
similar side chain. Families of amino acid residues having similar side chains have been 
defined in the art. These families include amino acids with basic side chains (e.g. , 
lysine, arginine, histidine), acidic side chains (e.g. , aspartic acid, glutamic acid), 
uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, 
10 tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, 
proline, phenylalanine, methionine, tryptophan)/ beta-branched side chains (e.g. , 
threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, 
tryptophan, histidine). Thus, a predicted nonessential amino acid residue in FDRG is 
preferably replaced with another amino acid residue from the same side chain family. 
1 5 Alternatively, in another embodiment, mutations can be introduced randomly along all 
or part of a FDRG coding sequence, such as by saturation mutagenesis, and the resultant 
mutants can be screened for FDRG biological activity to identify mutants that retain 
activity. Following mutagenesis of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ 
ID NO:l 1, SEQ ID NO:13, SEQ ID N0.14, or the DNA insert of the plasmid deposited 
20 with ATCC as Accession Number 98632, the encoded protein can be expressed 
recombinantly and the activity of the protein can be determined. 

In a preferred embodiment, a mutant FDRG protein can be assayed for (1) 
activation of a FDRG-dependent signal transduction pathway; (2) modulation of 
secretion of a non-IL-17 cytokine; (3) modulation of secretion of IL- 17; (4) modulation , 
25 of surface expression of a cellular adhesion molecule; (5) modulation of a 

proinflammatory cytokine; (6) modulation of a hematopoietic cytokine; (7) modulation • 
of the development or differentiation of a FDRG-expressing cell; (8) modulation of the 
development or differentiation of a non- FDRG-expressing .cell; (9) modulation of the 
homeostasis of a FDRG-expressing cell; (10) modulation of the homeostasis of a non- 
30 FDRG-expressing cell; (1 1) modulation of adipocyte development and physiology; and 
(12) modulation of insulin signaling. " ; * 
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In addition to the nucleic acid molecules encoding FDRG proteins described 
above, another aspect of the invention pertains to isolated nucleic acid molecules which 
are antisense thereto. An "antisense" nucleic acid comprises a nucleotide sequence 
which is complementary to a "sense" nucleic acid encoding a protein, e.g. , 

5 complementary to the coding strand of a double-stranded cDNA molecule or 

complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can 
hydrogen bond to a sense nucleic acid. ,The antisense nucleic acid can be 
complementary to an entire FDRG coding strand, or to only a portion thereof. In one 
embodiment, an antisense nucleic acid molecule is antisense to a "coding region" of the 

1 0 coding strand of a nucleotide sequence encoding FDRG. The term "coding region" 
refers to the region of the nucleotide sequence comprising codons which are translated 
into amino acid residues (e.g., the coding region of human FDRG corresponds to SEQ 

ID NO:3). In another embodiment, the antisense nucleic acid molecule is antisense to a 
"noncoding region" of the coding strand of a nucleotide sequence encoding FDRG. The 
15 term "noncoding region" refers to 5' and 3' sequences which flank the coding region that 
are not translated' into amino acids (i.e., also referred to as S and 3' untranslated 
regions). 

Given the coding strand sequences encoding FDRG disclosed herein (e.g., SEQ 
ID NO:3 or SEQ ID NO:l 3), antisense nucleic acids of the invention can be designed 

20 according to the rules of Watson and Crick base pairing. The antisense nucleic acid 
molecule can be complementary to the entire coding region of FDRG mRNA, but more 
preferably is an oligonucleotide which is antisense to only a portion of the coding or 
noncoding region of FDRG mRNA. For example, the antisense oligonucleotide can be 
complementary to the region surrounding the translation start site of FDRG mRNA. An 

25 . antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 
50 nucleotides in length. An antisense nucleic acid of the invention can be constructed 
using chemical synthesis and enzymatic ligation reactions using procedures known in 
the art. For example, an antisense nucleic acid (e.g. , an antisense oligonucleotide) can 
be chemically synthesized using naturally occurring nucleotides or variously modified 

30 " nucleotides designed to increase the biological stability of the molecules or to increase 
the physical stability of the duplex formed between the antisense and sense nucleic 
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acids, e.g., phosphorothioate derivatives and acridine substituted nucleotides can be 
used. Examples of modified nucleotides which can be used to generate the antisense 
nucleic acid include 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
hypoxanthine, xantine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5- 
5 carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosihe,' 

N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine ? 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 

10 beta-D-mannosylqueosine, 5-methoxycarboxymethyluracil, 5-methoxyuracil, 
2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, 
pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 
5-methyluracil, uracil-5- oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 
5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 

1 5 2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced 

biologically using an expression vector into which a nucleic acid has been subcloned in 
an antisense orientation (i.e., RNA transcribed from the inserted nucleic acid will be of 
an antisense orientation to a target nucleic acid of interest, described further in the 
following subsection). 

20 The antisense nucleic acid molecules of the invention are typically administered 

to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 
and/or genomic DNA encoding a FDRG protein to thereby inhibit expression of the 
protein, e.g. , by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the 

25 case of an antisense nucleic acid molecule which binds to DNA duplexes, through 
specific interactions in the major groove of the double helix. An example of a route of 
administration of antisense nucleic acid molecules of the invention include direct 
injection at a tissue site. Alternatively, antisense nucleic acid molecules can be modified 
to target selected cells and then administered systemically. For example, for systemic 

30 administration, antisense molecules can be modified such that they specifically bind to 
receptors or antigens expressed on a selected cell surface, e.g., by linking the antisense 
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nucleic acid molecules to peptides or antibodies which bind to cell surface receptors or 
antigens. The antisense nucleic acid molecules can also be delivered to cells using the 
vectors described herein. To achieve sufficient intracellular concentrations of the 
antisense molecules, vector constructs in which the antisense nucleic acid molecule is 
5 placed under the control of a strong pol II or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention 
is an a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual (J-units, the strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids. 
10 Res. 1 5:6625-6641). The antisense nucleic acid molecule can also comprise a 2'-o- 
methylribonucleotide(Inouee/ o/. (198.7) Nucleic Acids Res. 15:6131-6148) or a 
chimeric RNA-DNA analogue (Inoue et al. (1987) FEBS Lett. 215:327-330). 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity which are 
1 5 capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they 
have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes 
(described in Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to 
catalytically cleave FDRG mRNA transcripts to thereby inhibit translation of FDRG 
mRNA. A ribozyme having specificity for a FDRG-encoding nucleic acid can be 
20 designed based upon the nucleotide sequence of a FDRG cDNA disclosed herein (i.e. , 
SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ IDNO:l I, SEQ ID NO:13, SEQ ID 
NO:14, or the DNA insert of the plasmid deposited with ATCC as Accession Number 
98632). For example, a derivative of a Tetrahymena L-19 TVS'RNA can be constructed 
in which the nucleotide sequence of the active site is complementary to the nucleotide 
25 sequence to be cleaved in a FDRG-ehcoding mRNA. See, e.g. , Cech et al. U.S. Patent 
No. 4,987,071; and Cech et al. U.S. Patent No. 5,1 16,742. Alternatively, FDRG mRNA 
can be used to select a catalytic RNA having a specific ribonuclease activity from a pool 
of RNA molecules. See, e.g., Bartel, D. and Szostak, J.W.,(1993) Science 261:1411- 
1418. , •- 
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Alternatively, FDRG gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the FDRG (e.g. , the FDRG 
promoter and/or enhancers) to form triple helical structures that prevent transcription of 
the FDRG gene in target cells. See generally, Helene, C. (1991) Anticancer Drug Des. 
5 6(6):569-84; Helene, C. et al (1992) Ann. K Y. Acad Sci. 660:27-36; and Maher, L.J. 
(1992) Bioassays 14(12):807-15. 

In preferred embodiments, the nucleic acids of FDRG can be modified at the 
base moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, 
hybridization, or solubility of the molecule. For example, the deoxyribose phosphate 

1 0 backbone of the nucleic acids can be modified to generate peptide nucleic acids (see 
Hyrup B. et al (1996) Bioorganic & Medicinal Chemistry 4 (1): 5-23). As used herein, 
the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA 
mimics, in which the deoxyribose phosphate backbone is replaced by a pseudopeptide 
backbone and only the four natural nucleobases are retained. The neutral backbone of . 

1 5 PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PN A oligomers can be performed 
using standard solid phase peptide synthesis protocols as described in Hyrup B. et al. 
(1996) supra; Perry-O'Keefe et al PNAS 93: 14670-675. 

PNAs of FDRG can be used therapeutic and diagnostic applications. For 

20 example, PNAs can be used as antisense or antigene agents for sequence-specific 
modulation of gene expression by, e.g. , inducing trascription or translation arrest or 
inhibiting replication. PNAs of FDRG can also be used, e.g. , in the analysis of single 
base pair mutations in a gene by, e.g., PNA directed PCR clamping; as 'artificial 
restriction enzymes when used in combination with other enzymes, e.g., SI nucleases 

25 (Hyrup B. ( 1 996) supra); or as probes or primers for DNA sequence and hybridization 
(Hyrup B et al. (1996) supra; Perry-O'Keefe supra). 

In another embodiment, PNAs of FDRG can be modified, e.g., to enhance their 
stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DN A chimeras, or by the use of liposomes or other techniques of 

30 drug delivery known in the art. For example, PNA-DNA chimeras of FDRG can be 
generated which may combine the advantageous properties of PNA and DNA. Such 
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chimeras allow DNA recognition enzymes, e.g., RNAse H and DNA polymerases, to 
interact with the DNA portion while the PNA portion would provide high binding 
affinity and specificity. PNA-DNA chimeras can be linked using linkers of appropriate 
lengths selected in terms of base stacking, number of bonds between the nucleobases, 

5 and orientation (Hyrup B. (1996) supra). The synthesis of PNA-DNA chimeras can be 
performed as described in Hyrup B. (1996) supra and Finn PJ. et al (1996) Nucleic 
Acids Research 24 (1 7): 3357-63. For example, a DNA chain can be synthesized on a 
solid support using standard phosphoramidite coupling chemistry and modified 
nucleoside analogs, e.g., 5 , -(4-methoxytrityl)amino-5 , -deoxy-thymidine 

10 phosphoramidite, can be used as a between the PNA and the 5* end of DNA (Mag, U..et 
al. (1989) Nucleic Acid Res. 17: 5973-88). PNA monomers are then coupled in a 
stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 3' DNA 
segment (Finn P.J. et al. (1 996) supra). Alternatively, chimeric molecules can be 
synthesized with a 5' DNA segment and a 3' PNA segment (Peterser, K.H. et al. (1 975) 

15 Bioorganic Med. Chem. Lett. 5:11 19-1 1 124). 

In other embodiments, the oligonucleotide may include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger et al, 1989, Proc. Natl. Acad. 
Sci. U.S.A. 86:6553-6556; Lemaitre et al., 1987, Proc. Natl. Acad. Sci. 84:648-652; 

20 PCT Publication No. W088/09810, published December 15, 1988) or the blood-brain 
barrier (see, e.g., PCT Publication No. W089/10134, published April 25, 1988). In 
addition, oligonucleotides can be modified with hybridization-triggered cleavage agents 
(See, e.g., Krol e/ al, 1988, BioTechniques 6:958-976) or intercalating agents. (See, 
e.g., Zon, 1988, Pharm. Res. 5:539-549). To this end, the oligonucleotide may be 

25 conjugated to another molecule, e.g. , a peptide, hybridization triggered cross-linking 
agent, transport agent, hybridization^triggered cleavage agent, etc. 

II. Isolated FDRG Proteins and Anti-FDRG Antibodies 

One aspect of the invention pertains.to isolated FDRG proteins, and biologically 
30 active portions thereof, as well as polypeptide fragments suitable for use as immunogens 
to raise anti-FDRG antibodies. In one embodiment, native FDRG proteins can be 
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isolated from cells or tissue sources by an appropriate purification scheme using 
standard protein purification techniques. In another embodiment, FDRG proteins are 
produced by recombinant DfJA techniques. Alternative to recombinant expression, a 
FDRG protein or polypeptide can be synthesized chemically using standard peptide 

4 5 synthesis techniques. 

An "isolated" or "purified" protein or biologically active portion thereof is 
substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the FDRG protein is derived, or substantially free from 
chemical precursors or other chemicals when chemically synthesized. The language 

1 0 "substantially free of cellular material" includes preparations of FDRG protein in which 
the protein is separated from cellular components of the cells from which it is isolated or 
recombinantly produced. In one embodiment, the language "substantially free of 
cellular material" includes preparations of FDRG protein having less than about 30% (by 
dry weight) of non-FDRG protein (also referred to herein as a "contaminating protein"), 

15 more preferably less than about 20% of non-FDRG protein, still more preferably less 
than about 1 0% of non-FDRG protein, and most preferably less than about 5% non- 
FDRG protein. When the FDRG protein or biologically active portion thereof is 
recombinantly produced, it is also preferably substantially free of culture medium, i.e., 
culture medium represents less than about 20%, more preferably less than about 1 0%, 

20 and most preferably less than about 5% of the volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" 
includes preparations of FDRG protein in which the protein is separated from chemical 
precursors or other chemicals which are involved in the synthesis of the protein. . In one 
embodiment, the language "substantially free of chemical precursors or other chemicals" m 

25 includes preparations of FDRG protein having less than about 30% (by dry weight) of 
chemical precursors or non-FDRG chemicals, more preferably less than about 20% 
chemical precursors or non-FDRG chemicals, still more preferably less than about 1 0% 
chemical precursors or non-FDRG chemicals, and most preferably less than about 5% 
chemical precursors or non-FDRG chemicals. 
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Biologically active portions of a FDRG protein include peptides comprising 
amino acid sequences sufficiently, homologous to or derived from the amino acid 
sequence of the FDRG protein, e.g. , the amino acid sequence shown in SEQ ID NO:2, 
SEQ ID NO:5, SEQ ID NO: 12 or SEQ ID NO: 15, which include less amino acids than 

5 the full length FDRG proteins, and exhibit at least one activity of a FDRG protein. 
Typically , biologically active portions comprise a domain or motif with at least one 

s activity of the FDRG protein. A biologically active portion of a FDRG protein can be a 
polypeptide which is, for example, 10,25,50, 100 or more amino acids in length. 

In one embodiment, a biologically active portion of a FDRG protein comprises 

10 at least a C-terminal fibrinogen-like domain. In yet another embodiment, a biologically 
active portion of a FDRG protein comprises at least a signal sequence. In an alternative 
embodiment, a biologically active portion of a FDRG protein comprises a FDRG amino 
acid sequence lacking a signal sequence. /' : 

It is to be understood that a preferred biologically active portion of a FDRG 

15 protein of the present invention may contain at least one of the above-identified 

structural domains. A more preferred biologically active portion of a FDRG protein 
may contain at least two of the above-identified structural domains. An even more 
preferred biologically active portion of a FDRG protein may contain ai least three or 
more of the above-identified structural domains. , 

20 Moreover, other biologically active portions; in which other regions of the 

protein are deleted, can be prepared by recombinant techniques and evaluated for one or 
more of the functional activities of a native FDRG protein. 

In a preferred embodiment, the FDRG protein has an amino acid sequence shown 
in SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:12 or SEQ ID NO:15. In other 

25 embodiments, the FDRG protein is substantially homologous to SEQ ID NO:2, SEQ ID 
NO:5, SEQ IDNO:12 or SEQ IDNO:I5 and retains the functional activity of the 
protein of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:12 or SEQ ID NO:15 yet differs in 
amino acid sequence due to natural allelic variation or mutagenesis, as described in 
detail in subsection I above. Accordingly, in another embodiment, the FDRG protein is 

30 a protein which comprises an amino acid sequence at least about 60% homologous to the 
amino acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:12 or SEQ ID 
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NO:15 and retains the functional activity of the FDRG proteins of SEQ ID NO:2, SEQ 
ID NO:5, SEQ ID NO:12 or SEQ ID NO:15, respectively. Preferably, the protein is at 
least about 70% homologous to SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:I2 or SEQ 
ID NO:l 5, more preferably at least about 80% homologous to SEQ ID NO:2, SEQ ID m 

5 NO:5, SEQ ID NO:12 or SEQ ID NO:15, even more preferably at least about 90% 
homologous to SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:12 or SEQ ID NO:15, and 
most preferably at least about 95% or more homologous to SEQ ID NO:2, SEQ ID 
NO:5, SEQ ID NO: 12 or SEQ ID NO: 15. 

To determine the percent homology of two amino acid sequences or of two 

10 nucleic acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can 
be introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
alignment with a second amino or nucleic acid sequence and non-homologous sequences 
can be disregarded for comparison purposes). In a preferred embodiment, the length of a 
reference sequence aligned for comparison purposes is at least 30%, preferably at least 

1 5 40%, more preferably at least 50%, even more preferably at least 60%, and even more 
preferably at least 70%, 80%, or 90% of the length of the reference sequence (e.g. , when 
aligning a second sequence to the FDRG amino acid sequence of SEQ ID NO:2, having 
86 amino acid residues, at least 66, preferably at least 46, more preferably at least 26 are 
aligned). The amino acid residues or nucleotides at corresponding amino acid positions 

20 or nucleotide positions are then compared. When a position in the first sequence is 
occupied by the same amino acid residue or nucleotide as the corresponding position in 
the second sequence, then the molecules are homologous at that position (/.*., as used 
herein amino acid or nucleic acid "homology" is equivalent to amino acid or nucleic acid 
"identity"). The percent homology between the two sequences is a function of the 

25 number of identical positions shared by the sequences (/.£?., % homology = # of identical 
positions/total # of positions x 100). 

The comparison of sequences and determination of percent homology between 
two sequences can be accomplished using a mathematical algorithim. A preferred, non- 
limiting example of a mathematical algorithim utilized for the comparison of sequences 

30 is the algorithm of Karlin and Altschul (1 990) Proc. Natl. Acad. Sci. USA 87:2264-68, 
modified as in Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-77. Such 
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an algorithm is incorporated 'into the NBLAST and XBLAST programs (version 2.0) of 
Altschul, et al. (1990) J. Mol. Biol. 215:403-10. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 1 00, wordlength = 12 to obtain 
nucleotide sequences homologous to FDRG nucleic acid molecules of the invention. 
BLAST protein searches can be performed with the XBLAST program, score = 50, 
wordlength = 3 to obtain amino acid sequences homologous to TAP-1 protein molecules 
of the invention. To obtain gapped alignments for comparison purposes, Gapped 
BLAST can be utilized as described in Altschul et al, (1997) Nucleic Acids Research 
25(1 7):33 89-3402. When utilizing BLAST and Gapped BLAST programs, the default 
parameters of the respective programs (eg., XBLAST and NBLAST) can be used. See 
http://www.ncbi:nlm.nih.gov. Another preferred, non-limiting example of a 
mathematical algorithim utilized for the comparison of sequences is the algorithm of 
Myers and Miller, CABIOS (1989). Such an algorithm is incorporated into the ALIGN 
program (version 2.0) which is part of the GCG sequence alignment software package. 
When utilizing the ALIGN program for comparing amino acid sequences, a PAM120 
weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be used. 
Alternatively, a PAM 250 residue Weight Table, a GAP penalty of 10, and a GAP length 
penalty of 1 0 can be used. Another preferred, non-limiting example of a mathematical 
algorithim utilized for the comparison of sequences is the algorithm of Wilbur-Lipmann 
which is part ofMegAlign™ sequence alignment software. When utilizing the Wilbur- 
Lipmann algorithm, a K-tuple of 1; a GAP penalty of 3, a window of 5, and diagonals 
saved set to = 5 can be used. Multiple alignment can be performed using the Clustal 
algorithm. 

The invention also provides FDRG chimeric or fusion proteins. As used herein, 
a FDRG "chimeric protein" or "fusion protein" comprises a FDRG polypeptide 
operatively linked to a non-FDRG polypeptide. A "FDRG polypeptide" refers to. a 
polypeptide having an amino acid sequence corresponding to FDRG, whereas a "non- 
FDRG polypeptide" refers to a polypeptide having an amino acid sequence 
corresponding to a protein which is not substantially homologous to the FDRG protein, 
e.g., a protein which is different from the FDRG protein and which is derived from the 
same or a different organism. Within a FDRG fusion protein the FDRG polypeptide 
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can correspond to all or a portion of a FDRG protein. In a preferred embodiment, a 
FDRG fusion protein comprises at least one biologically active portion of a FDRG 
protein In another preferred embodiment, a FDRG fusion protein comprises at least two 
biologically active portions of a FDRG protein. In another preferred embodiment, a 

5 FDRG fusion protein comprises at least three biologically active portions, of a FDRG 
protein. Within the fusion protein, the term "operatively linked" is intended to indicate 
that the FDRG polypeptide and the non-FDRG polypeptide are fused imframe to each . 
other. The non-FDRG polypeptide can be fused to the N-terminus or C-terminus of the 
FDRG polypeptide. . . '1 

10 For example, in one embodiment, the fusion protein is a GST-FDRG fusion 

protein in which'the FDRG sequences are fused to the C-terminus of the GST sequences. 
Such fusion proteins can facilitate the purification of recombinant FDRG. 

In anomer embodiment, me fusion protein is a FDRG protein containing a > 
heterologous signal sequence at its N-terminus. For example, the native FDRG signal 

15 sequence (i.e,aboiit amino acids 1 to 25 of SEQ ID NO:2) can be removed and replaced 
with a signal sequence from another protein. In certain host cells (e.g., mammalian host 
cells), expression and/qr secretion of FDRG can be increased through use of a 
heterologous signal sequence. . . 

In yet another embodiment, the fusion protein is a FDRG-immunoglobulin 

20 fusion protein in which the FDRG sequences comprising primarily the FDRG 
extracellular domain are fused to sequences derived from a member of die 
immunoglobulin protein family.. Soluble derivatives have also been made of cell surface 
glycoproteins in the immunoglobulin gene superfamily consisting of an. extracellular 
domain of the cell surface glycoprotein fused to an immunoglobulin constant (Fc) region 

25 (see e.g., Capon, DJ.etal. (1989) iVfl^e3J7:525-531 and Capon U.S. Patents 

5,1 16,964 and 5,428,130 [CD4-IgGl constructs]; Linsley, P.S. er al. (1991) J. Exp. Med. 
173:721-730 [a CD28-IgGl construct and a B7rl-IgGl construct]; and Linsley, P.S. et 
al. (1991) J. Exp. Med. 174:561-569 and U.S. Patent 5,434,131 [a CTLA4-IgGl])... Such 
fusion proteins have proven useful for modulating receptor-ligand interactions. Soluble 

30 derivatives of cell surface proteins of the tumor necrosis factor receptor (TNFR) 

superfamily proteins have been made consisting of an extracellular domain of the cell . 
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surface receptor fused to an immunoglobulin constant (Fc) region (see for example 
Moreland et al (1997) N. EngL J. Med. 337(3):141-147; van der Poll et al (1997) 
Blood 89(10):3727-3734; and Ammann et al (1997) J. Clin. Invest. 99(7): 1699-1 703.) 
Furthermore, fusion proteins have been made using the CH2 and CH3 domains of IgG 
5 fused downstream of murine IL-17 leader sequences and upstream of murine CTLA-8 
sequences and upstream of HVS13 sequences (see for example Yao et al (1995) 
Immunity 8:811-821.) \.J- * 

The FDRG-immunoglobulin fusion proteins of the invention can be incorporated 
into pharmaceutical compositions arid administered to a subject to inhibit ah interaction 

10 between a FDRG protein and a FDRG receptor on the surface of a cell, to thereby 

suppress FDRG-mediated cellular function in vivo. The FDRG-immunoglobulin fusion 
proteins can be used to affect the bioavailability of a FDRG protein. Inhibition of the 
FDRG protein/FDRG receptor interaction may be useful therapeutically, for example, in 
regulation of the cellular immune response, regulation of inflammation, regulation of 

1 5 hematopoiesis, regulation of adipogenesis, or regulation of angiogenesis. Moreover, the 
FDRG-immunoglobulin fusion proteins of the invention can be used as immunogens to 
produce anti-FDRG antibodies in a subject, to purify FDRG receptors and in screening 
assays to identify molecules which inhibit the interaction of a FDRG protein with a 
FDRG receptor. 

20 Preferably, a FDRG chimeric or fusion protein of the invention is produced by 

standard recombinant DN A techniques. For example, DNA fragments coding for the 
different polypeptide sequences are ligated together in-frame in accordance with 
conventional techniques^for example by employing blunt-ended or stagger-ended . 
termini for ligation; restriction enzyme digestion to provide for appropriate termini, 

25 filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid 
undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene 
can be synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PCR amplification of gene fragments can be carried out using anchor 
primers which give rise to complementary overhangs between two consecutive gene 

30 fragments which can subsequently be annealed and reamplified to generate a chimeric 
gene sequence (see, for example, Current Protocols in Molecular Biology, eds. Ausubel 



WO 99/45135 PCT/US99/04548 

-39- 

et al John Wiley & Sons: 1992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide). A FDRG- 
encoding nucleic acid can be cloned into such an expression Vector such that the fusion 
moiety is linked in-frame to the FDRG protein. 
5 The present invention also pertains to variants of the FDRG proteins which 

function as either FDRG agonists (mimetics) or as FDRG antagonists. Variants of the 
FDRG protein can be generated by mutagenesis, e.g., discrete point mutation or 
truncation of the FDRG protein. An agonist of the FDRG protein can retain 
substantially the same, or a subset, of the biological activities of the naturally occurring 

1 0 form of the FDRG protein. An antagonist of the FDRG protein can inhibit one or more 
of the activities of the naturally occurring form of the FDRG protein by, for example, 
competitively binding to a FDRG receptor of FDRG-binding protein. Thus, specific 
biological effects can be elicited by treatment with a variant of limited function. In one 
embodiment, treatment of a subject with a variant having a subset of the biological 

1 5 activities of the naturally occurring form of the protein has fewer side effects in a subject 
relative to treatment with the naturally occurring form of the FDRG proteins. 

In one embodiment, variants of the FDRG protein which function as either 
FDRG agonists (mimetics) or as FDRG antagonists can be identified by- screening 
combinatorial libraries of mutants, (e.g., truncation mutants) of the FDRG protein for 

20 FDRG protein agonist or antagonist activity. In one embodiment, a variegated library of 
FDRG variants is generated by combinatorial mutagenesis at the nucleic acid level and 
is encoded by a variegated gene library. A variegated library of FDRG variants can be 
produced by, for example, enzymatically ligating a mixture of synthetic oligonucleotides 
into gene sequences such that a degenerate set of potential FDRG sequences is 

25 expressible as individual polypeptides, or alternatively, as a set of larger fusion proteins 
(e.g., for phage display) containing the set of FDRG sequences therein. There are a 
variety of methods which can be used to produce libraries of potential FDRG variants 
from a degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene 
sequence can be performed in an automatic DNA synthesizer, and the synthetic gene 

30 then ligated into an appropriate expression vector. Use of a degenerate set of genes 
allows for the provision, in one mixture, of all of the sequences encoding the desired set 
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of potential FDRG sequences. Methods for synthesizing degenerate oligonucleotides are 
known in the art (see, e.g.,.Narang, S A. (m3) Tetrahedron 39:3; Itakurae/ a/. (1984) 
Annu; Rev. Biochem. 53:323; Itakura et al (1984) Science 198:1056; Ike et al (1983) 
Nucleic Acid Res . 1 1:477 

5 In addition, libraries of fragments of the FDRG protein coding sequence can be 

used to generate a variegated population of FDRG fragments for screening and 
subsequent selection of variants of a FDRG protein. In one embodiment, a library of 
coding sequence fragments can be generated, by treating a double stranded PCR , 
fragment of a FDRG coding sequence with a nuclease under conditions wherein nicking 

1 0 occurs only about once per molecule, denaturing the double stranded DN A, renaturing 
the DNA to form double stranded DNA, which can include sense/antisense pairs from 
different nicked products, removing single stranded portions from reformed duplexes by 
treatment with SI nuclease, and ligating the resulting fragment library into an expression 
vector. By this method, an expression library can be derived which encodes N-terminal, 

1 5 C-terminal and internal fragments of various sizes of the FDRG protein. 

Several techniques are known in the art'for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. Such techniques are adaptable for 
rapid screening of the gene libraries generated by the combinatorial mutagenesis of 

20 FDRG proteins. The most widely used techniques, which are amenable to high through- 
put analysis, for screening large gene libraries typically include cloning the gene library 
into replicable expression vectors, transforming appropriate cells with the resulting 
library of vectors, and expressing the combinatorial genes under conditions in which 
detection of a desired activity facilitates isolation of the vector encoding the gene whose 

25 product was detected. Recrusive ensemble mutagenesis (REM), a hew technique which 
enhances the frequency of functional mutants in the libraries, can be used in 
combination with the screening assays to identify FDRG variants ( Arkin and Yourvan 
(1992) PNAS 59:781 1-7815; Delgrave et al (1993) Protein Engineering 6(3):327-331). 
In one embodiment, cell based assays can be exploited to analyze a variegated 

30 FDRG library. For example, a library of expression vectors can be transfected into a cell 
line which ordinarily secretes FDRG protein. Supernatants from the transfected cells are 
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then contacted with FDRG-responsive cells and the effect of the mutation in FDRG can 
be detected, e.g., by measuring any of a number of FDRG-responsive cell responses. 
Plasmid DNA can then be recovered from the mutant FDRG-secreting cells which score 
for inhibition, or alternatively, potentiation of the FDRG-dependent response, and the 
5 individual clones further characterized. 

♦ 

An isolated FDRG protein, or a portion or fragment thereof, can be used as an 
immunogen to generate antibodies that bind FDRG using standard techniques for 
polyclonal and monoclonal antibody preparation. The full-length FDRG protein can be 
used or, alternatively, the invention provides antigenic peptide fragments of FDRG for 

10 use as immunogens. The antigenic peptide of FDRG comprises at least 8 amino acid 
residues of the amino acid sequence shown in SEQ ID NO:2, SEQ ID NO:5, SEQ ID 
NO:12 or SEQ ID NO:15 and encompasses an epitope of FDRG such that an antibody 
raised against the peptide forms a specific immune complex with FDRG. Preferably, the 
antigenic peptide comprises at least 10 amino acid residues, more preferably at least 15 

15 amino acid residues, even more preferably at least 20 amino acid residues, and most 
preferably at least 30 amino acid residues. . 

A FDRG immunogen typically is used to prepare antibodies by immunizing a 
suitable subject, (e.g., rabbit, goat, mouse or other mammal) with the immunogen. An 
appropriate immunogenic preparation can contain, for example, recombinantly 

20 expressed FDRG protein or a chemically synthesized FDRG polypeptide. The 

preparation can further include an adjuvant, such as Freund's complete or incomplete 
adjuvant, or similar immunostimulatory agent. Immunization of a suitable subject with 
an immunogenic FDRG preparation induces a polyclonal anti-FDRG antibody response. 
Accordingly, another aspect of the invention pertains to anti-FDRG antibodies. 

25 The term "antibody" as used herein refers to immunoglobulin molecules and 

immunologically active portions of immunoglobulin molecules, /.e., molecules that 
contain an antigen binding site which specifically binds (immunoreacts with) an antigen, 
such as FDRG. Examples of immunologically active portions of immunoglobulin 
molecules include F(ab) and F(ab')2 fragments which can be generated by treating the 

30 antibody with an enzyme such as pepsin. The invention provides polyclonal and 
monoclonal antibodies that bind FDRG. The term "monoclonal antibody" or 
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"monoclonal antibody composition", as used herein, refers to a population of antibody 
molecules that contain only one species of an antigen binding site capable of 
immunoreacting with a particular epitope of FDRG. A monoclonal antibody 
composition thus typically displays a single binding affinity for a particular FDRG 
5 protein with which it immunoreacts. 

Polyclonal anti-FDRG antibodies can be prepared as described above by 
immunizing a suitable subject with a FDRG immunogen. The anti-FDRG antibody titer 
in the immunized subject can be monitored over time by standard techniques, such as 
with an enzyme linked immunosorbent assay (ELISA) using immobilized FDRG. If 

10 desired, the antibody molecules directed against FDRG can be isolated from the 

mammal (e.g., from the blood) and further purified by well known techniques, such as 
protein A chromatography to obtain the IgG fraction. At an appropriate time after 
immunization, e.g., when the anti-FDRG antibody titers are highest, antibody-producing 
cells can be obtained from the subject and used to prepare monoclonal antibodies by 

1 5 standard techniques, such as the hybridoma technique originally described by Kohler 
and Milstein (1975) Nature 256:495-497) (see also, Brown et al (1981) J. Immunol 
127:539-46; Brown et al (1980) J. Biol Chem ^255:4980-83; Yeh etal (1976) PNAS 
76:2927-31; and Yehetal (1982) Int. J. Cancer 29:269-75), the more recent human B 
cell hybridoma technique (Kozbor et al (1983) Immunol Today 4:72), the EBV- 

20 hybridoma technique (Cole et al (1985), Monoclonal Antibodies and Cancer Therapy, 
Alan R. Liss, Inc., pp. 77-96) or trioma techniques.. The technology for producing 
monoclonal antibody hybridomas is well known (see generally R. H. Kenneth, in 
Monoclonal Antibodies: A New Dimension In Biological Analyses, Plenum Publishing 
Corp., New York, New York (1980); E. A. Lerner (1981) Yale J. Biol Med, 

25 54:387-402; M L. Gefter et al (1977) Somatic Cell Genet. 3:231-36). Briefly, an 

immortal cell line (typically a myeloma) is fused to lymphocytes (typically splenocytes) 
from a mammal immunized with a FDRG immunogen as described above, and the 
culture supernatants of the resulting hybridoma cells are screened to identify a 
hybridoma producing a monoclonal antibody that binds FDRG. 

30 Any of the many well known protocols used for fusing lymphocytes and 

immortalized cell lines can be applied for the purpose of generating an anti-FDRG 
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monoclonal antibody (see, e.g., G. Galfte et al. (1977) Nature 266:55052; Gefter et al 
Somatic Cell Genet., cited supra; Lerner,7b/e J. Biol. Med, cited supra; Kenneth, 
Monoclonal Antibodies, cited supra). Moreover, the ordinarily skilled worker will 
appreciate that there are many variations of such methods which also would be useful. 

5 Typically, the immortal cell line (e.g. , a myeloma cell line) is derived from the same 
mammalian species as the lymphocytes. For example, murine hybridomas can be made 
by fusing lymphocytes from a mouse immunized with an immunogenic preparation of 
the present invention with an immortalized mouse cell line. ! Preferred immortal cell 
lines are mouse myeloma cell lines that are sensitive to culture medium containing 

1 0 hypoxanthine, aminopterin and thymidine ("HAT medium"). Any of a number of 
myeloma cell lines can be used as a fusion partner according to standard techniques, 
kg., the P3-NSl/lrAg4-l, P3-x63-Ag8.653 or Sp2/0-Agl4 myeloma lines. These 
myeloma lines are available from ATCC. Typically, HAT-sensitive mouse myeloma 
cells are fused to mouse splenocytes using polyethylene glycol ("PEG"). Hybridoma 

15 cells resulting from the fusion are then selected using HAT medium, which kills unfused 
and unproductively fused myeloma cells (unfused splenocytes die after several days 
because they are not transformed). Hybridoma cells producing a monoclonal antibody 
of the invention are detected by screening the hybridoma culture supernatants for 
antibodies that bind FDRG, e.g. , using a standard ELISA assay. 

20 Alternative to preparing monoclonal antibody-secreting hybridomas, a 

monoclonal anti-FDRG antibody can be identified and isolated by screening a 
recombinant combinatorial immunoglobulin library (e.g. , an antibody phage display 
library) with FDRG to thereby isolate immunoglobulin library members that bind 
FDRG. Kits for generating and screening phage display libraries are commercially 

25 available (e.g. , the Pharmacia Recombinant Phage Antibody System, Catalog No. 27- 
,9400-01; and the Stratagene SurfZAP™ Phage Display Kit, Catalog No. 240612). 
Additionally, examples of methods and reagents particularly amenable for use in 
generating and screening antibody display library can be found in, for example, Ladner 
et al U.S. Patent No. 5,223,409; Kang et al PCT International Publication No. WO 

30 92/1 861 9; Dower et al. PCT International Publication No. WO 91/1 7271 ; Winter et al 
PCT International Publication WO 92/20791; Markland et al PCT International 
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Publication No. WO 92/1 5679; Breitling et al. PCT International Publication WO . 
93/01288; McCafferty et al. PCT International Publication No. WO 92/01047; Garrard 
et al PCT International Publication No. WO 92/09690; Ladner et al. PCT International ' 
Publication No. WO 90/02809; Fuchs etal. (1991) Bio/Technology 9:1370-1372; Hay et 
5 al. (1992) Hum. Antibod. Hybridomas 3:81-85; Huse etal. (1989) Science 246:1275- 
1281; Griffiths et al. (\993)EMBOJ 12:725-734; Hawkins et al. (1992) J. Mol. Biol. 
226:889-896; Clarkson et al. (1991) Nature 352:624-628; Gram et al. (1992) PNAS. 
89:3576-3580; Garrad et al. (1991) Bio/Technology 9:1373-1377; Hoogenboom et al. 
(1991) Nuc. Acid Res. 19:4133-4137; Barbase/ al. (1991) PNAS 88:7978-7982; and 

10 McCafferty et al Nature (1990) 348:552-554. 

Additionally, recombinant anti-FDRG antibodies, such as chimeric and 
humanized monoclonal antibodies, comprising both human and non-human portions, • 
which can be made using standard recombinant DNA techniques, are within the scope of 
the invention. Such chimeric and humanized monoclonal antibodies can be produced by 

1 5 recombinant DNA techniques known in the art, for example using methods described in 
Robinson et al. International Application No. PCT/US86/02269; Akira, et al. European 
Patent Application 184,1 87; Taniguchi, M., European Patent Application 171,496; 
Morrison et al. European Patent Application 1 73,494; Neuberger et al. PCT 
International Publication No. WO 86/01533; Cabilly et al U.S. Patent No. 4,81 6,567; 

20 Cabilly et al. European Patent Application 125,023; Better et al. (1988) Science 
240:1041-1043; Uuet al (1987) PNAS 84:3439-3443; Liu et al. (1987) J. Immunol. 
139:3521-3526; Sun et al (1987) PNAS 84:214-218; Nishimura et al (1987) Cane. Res, 
47:999-1005; Wood et al (1985) Nature 3 14:446-449; and Shaw et al (1988) J. Natl, 
Cancer Inst. 80:1553-1559); Morrison, S. L. (1985) Science 229:1202-1207; Oi et al 

25 (1986) BioTechniques 4:214; Winter U.S. Patent 5,225,539; Jones et al. (1986) Nature 
321:552-525; Verhoeyan et al. (1988) Science 239:1534; and Beidler et al. (1988) J. 
Immunol. 141:4053-4060. 

An anti-FDRG antibody (e.g., monoclonal antibody) can be used to isolate 
FDRG by standard techniques, such as affinity chromatography or immunoprecipitation. 

30 An anti-FDRG antibody can facilitate the purification of natural FDRG from cells and of 
recombinant^ produced FDRG expressed in host cells. Moreover, an anti-FDRG 
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antibody can be used to detect FDRG protein (e.g. , in a cellular- lysate or cell 
supernatant) in order to evaluate the abundance and pattern of expression of the FDRG 
protein. Anti-FDRG antibodies can be used diagnostically to monitor protein levels in 
tissue as part of a clinical testing procedure, e.g. , to, for example, determine the efficacy 
5 f of a given treatment regimen. Detection can be facilitated by coupling (/.<?., physically 
linking) the antibody to a detectable substance. Examples of detectable substances 
. include various enzymes, prosthetic groups, fluorescent materials, luminescent 
materials, bioluminescent materials, and radioactive materials. Examples of suitable 
enzymes include horseradish peroxidase, alkaline phosphatase, p-galactosidase, or 

10 acetylcholinesterase; examples of suitable prosthetic group complexes include 

streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a < 
luminescent material includes luminol; examples of bioluminescent materials include * 

15 luciferase, luciferin, and aequorin, and examples of suitable radioactive material include 

,25 i, 131 i, 35 sor 3 H; .;' ; ^ ; 

III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression 
20 vectors, containing a nucleic acid encoding FDRG (or a portion thereof). As used 
herein, the term "vector" refers to a nucleic acid molecule capable of transporting 
another nucleic acid to which it has been linked. One type of vector is a "plasmid", 
which refers to a circular double stranded DNA loop into which additional DNA 
segments can be ligated. Another type of vector is a viral vector, wherein additional 
25' * DNA segments can be ligated into the viral genome. Certain vectors are capable of 
autonomous replication in a host cell into which they are introduced (e.g. , bacterial 
vectors having a bacterial origin of replication and episomal mammalian vectors). Other 
vectors (e.g., non-episbmal mammalian vectors) are integrated into the genome of a host 
cell upon introduction into the host cell, and thereby are replicated along with the host 
30 genome. Moreover, certain vectors are capable of directing the expression of genes to 
which they are bperatively linked. Such vectors are referred to herein as "expression 
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vectors". In general, expression vectors of utility in recombinant DNA techniques are 
often in the form of plasmids. In the present specification, "plasmid" and "vector" can 
be used interchangeably as the plasmid is the most commonly used form of vector. 
However, the invention is intended to include such other forms of expression vectors, 

5 such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno- . 
associated viruses), which serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 
, the invention in a form suitable for expression of the nucleic acid in a host cell, which 
means that the recombinant expression vectors include one or more regulatory 

10 sequences, selected on the basis of the host cells to be used for expression, which is 
operatively linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 
of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a 

1 5 host cell when the vector is introduced into the host cell). The term "regulatory 
sequence" is intended to includes promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel; Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, CA (1990). Regulatory sequences include those which 

20 direct constitutive expression of a nucleotide sequence in many types of host cell and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of the 
host cell to be transformed, die level of expression of protein desired, etc. The 

25 expression vectors of the invention can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., FDRG proteins, mutant forms of FDRG, fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed for 
expression of FDRG in prokaryotic or eukaryotic cells. For example, FDRG can be 

30 ■ expressed in bacterial cells such as E coli, insect cells (using baculovirus expression 
vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in 
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• Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, 
San Diego, CA (1990). Alternatively, the recombinant expression vector can be 
transcribed and translated in vitro, for example using T7 promoter regulatory sequences 
and T7 polymerase. 

5 Expression of proteins in prokaryotes is most often carried out in E. coli with 

vectors containing constitutive or inducible promotors directing the expression of either 
fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of recombinant protein; 

1 0 2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from 
the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and 

15 their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, D.B. 
and Johnson, K.S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) 
and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 
maltose E binding protein, or protein A, respectively, to the target recombinant protein. 

20 "'■ Purified fusion proteins can be utilized in FDRG activity assays, in FDRG ligand 

binding (e.g., direct assays or competitive assays described in detail below), to generate 
antibodies specific for FDRG proteins, as examples. In a preferred embodiment, a 
FDRG fusion expressed in a retroviral expression vector of the present invention can be 
utilized to infect bone marrow cells which are subsequently transplanted into irradiated 

25 recipients. The pathology of the subject recipient is then examined after sufficient time 
has passed (e.g. , six (6) weeks). 

Examples of suitable inducible non-fusion E. coli expression vectors include 
pTrc (Amann et al, (1988) Gene 69:301-3 15) and pET 1 Id (Studier et of., Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, 

30 California (1990) 60-89). Target gene expression from the pTrc vector relies on host 
RNA polymerase transcription from a hybrid trp-lac fusion promoter. Target gene 
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expression from the pET 1 Id vector relies on transcription from a f 7 gnl 0-lac fusion . 
promoter mediated by a coexpressed viral RN A polymerase (T7gnl). This viral 
polymerase is supplied by host strains BL21(DE3) or HMS174(DE3) from a resident X 
prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 
promoter. 

One strategy to maximize recombinant protein expression in K coli is to express 
the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, S., Gene Expression Technology: Methods in 
EnzymologyWS, Academic Press, San Diego, California (1990) 119-128). Another 
strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 
expression vector so that the individual codons for each amino acid are those 
' preferentially utilized in E. coli (Wada etal, (1992) Nucleic Acids Res. 20:21 1 1.-2118). 
Such alteration of nucleic acid sequences of the invention can be carried out by standard 
DNA synthesis techniques. : ■ 

In another embodiment, the FDRG expression vector is a yeast expression 
vector. Examples of vectors for expression in yeast S. cerivisae include pYepSecl 
(Baldari, et al, (1987) EMBOJ. 6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 
30:933-943), pJRY88 (Schultz et al, (1987) Gene 54:1 13-123), pYES2 (Invitrogen 
Corporation, San Diego, CA), and picZ (InVitrogen Corp, Carlsbad, CA). 

Alternatively, FDRG can be expressed in insect cells using baculovirus 
expression vectors. Baculovirus vectors available for expression of proteins in cultured 
insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al. (1983) Mol Cell Biol. 
3:2156-2165) and the pVL series (Lucklow and Summers (1989) Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed, B. (1987) Nature 329:840) and pMT2PC 
(Kaufman et al. (1987) EMBOJ. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook, J., Fritsh, E. F., 
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and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed. f Cold Spring 
Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989. " ; 

' In another embodiment, the recombinant mammalian expression vector is 
5 capable of directing expression of the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the ; albumin promoter (liver-specific;.Pinkert et al. 
(1987) Genes Dev: 1 :268-277), lymphoid-specific promoters (Calame and Eaton (1988) 

10 Adv. Immunol 43:235-275), in particular promoters of T cell receptors; (Winoto and 
Baltimore (1989) EMBOJ. 8:729-733) and immunoglobulins (Banerji et al (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 
(e.g. , the neurofilament promoter; Byrne and Ruddle (1 989) PNAS 86:5473-5477), 
pancreas-specific promoters (Edlund et al. (1985) Science 230:912-916), and mammary 

1 5 gland-specific promoters (e.g. ; milk whey promoter; U.S. Patent No. 4 ? 873,3 1 6 and 
European Application Publication No. 264,166). Developmentally-regulated promoters 
are also encompassed, for example the murine hox promoters (Kessel and Gruss (1990) 
Science 249:374-379) and the a-fetoprotein promoter (Campes and tilghman (1989) 
Genes Dev. 3:537-546). 

20 The invention further provides a recombinant expression vector comprising a 

DN A molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in 
a manner which allows for expression (by transcription of the DNA molecule) of an 
RNA molecule which is antisense to FDRG mRNA. Regulatory sequences operatively 

25 linked to a nucleic acid cloned in the antisense orientation can be chosen which direct 
the continuous expression of the antisense RNA molecule in a variety of cell types, for 
instance viral promoters and/or enhancers, or regulatory sequences can be chosen which 
direct constitutive, tissue specific or cell type specific expression of antisense RNA. The 
antisense expression vector can be in the form of a recombinant plasmid, phagemid or 

30 attenuated virus in which antisense nucleic acids are produced under the control of a 
high efficiency regulatory region, the activity of which can be determined by the cell 
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type into which the.vector is introduced.. For a discussion of the regulation of gene 
expression using antisense genes see Weintraub, H. et al. , Antisense RNA as a 
molecular tool for genetic analysis, Reviews - Trends in Genetics,Vol 1(1) 1986. 

Another aspect of the invention pertains to host cells into which a recombinant 

5 "expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such 
terms refer not only to the particular subject cell but to the progeny or potential progeny 
of such a cell. Because certain modifications may occur in succeeding generations due 
to either mutation or environmental influences, such progeny may not, in fact, be .'• 

1 0 identical to the parent cell, but are still included within the scope of the term as used 

herein. ' ' 

A host cell can be any prokaryotic or eukaryotic cell. For example, FDRG - 
protein can be expressed in bacterial cells such as E. coli, insect cells, yeast or 
mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). Other 

15 suitable host cells are known to those skilled in the art. . 

Vector DN A can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid (e.g. , DNA) into a host cell, including 

20 calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, etal. {Molecular Cloning: A 
Laboratory Manual. 2nd,ed., Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989), and other laboratory manuals. 

25 For.stable transfection of mammalian cells, it is known that, depending upon the 

expression vector and transfection technique, used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 

30 selectable markers include those which confer resistance to drugs, such as G418, 
hygromycin and methotrexate. Nucleic acid encoding a selectable marker can be 
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introduced into a host cell on the same vector as that encoding FDRG or can be 
introduced on a separate vector. Cells stably transfected with the introduced nucleic 
acid can be identified by drug selection (e.g. , cells that have incorporated the selectable 
marker gene will survive, while the other cells die). 
5 A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 

culture, can be used to produce (i.e., express) FDRG protein. Accordingly, the invention 
further provides methods for producing FDRG protein using the host cells of the 
invention. In one embodiment, the method comprises culturing the host cell of 
invention (into which a recombinant expression vector encoding FDRG has been 
1 0 introduced) in a suitable medium such that FDRG protein is produced. In another 

embodiment, the method further comprises isolating FDRG from the medium or the host 
cell. 

The host cells of the invention can also be used to produce nonhuman transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized 

15 oocyte or an embryonic stem cell into which FDRG-coding sequences have been 

introduced. Such host cells can then be used to create non-human transgenic animals in 
which exogenous FDRG sequences have been introduced into their genome or 
homologous recombinant animals in which endogenous FDRG sequences have been 
altered. Such animals are useful for studying the function and/or activity of FDRG and 

20 for identifying and/or evaluating modulators of FDRG activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 
rodent such as a rat or mouse, in which one or more of the cells of the animal includes a 
transgene. Other examples of transgenic animals include non-human primates, sheep, 
dogs, cows, goats, chickens, amphibians, etc.* A transgene is exogenous DNA which is 
- 25 integrated into the genome of a cell from which a transgenic animal develops and which 
remains in the genome of the mature animal, thereby directing the expression of an 
; encoded gene product in one or more cell types or tissues of the transgenic animal. As 
used herein, a "homologous recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous FDRG gene has been 

30 altered by homologous recombination between the endogenous gene and an exogenous 
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DNA molecule introduced into a cell of the animal, e.g. , an embryonic cell of the 
animal, prior to development of the animal. ♦ 

A transgenic animal of the invention can be created by introducing FDRG- 
encoding nucleic acid into the male pronuclei of a fertilized oocyte, e.g., by 
microinjection, retroviral infection, and allowing the oocyte to develop in a 
pseudopregnant female foster animal. The FDRGcDNA sequences of SEQ ID NO: 1, 
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 14, or the 
DNA insert of the plasmid deposited with ATCC as Accession Number 98632 can be 
introduced as a transgene into the genome of a non-human animal. Intronic sequences 
and polyadenylation signals can also be included in the transgene to increase the 
efficiency of expression of the transgene: A tissue-specific regulatory sequence(s) can 
be operably linked to the FDRG transgene to direct expression of FDRG protein to 
particular cells. Methods for generating transgenic animals via embryo manipulation 
and microinjection, particularly animals such as mice, have become conventional in the 
art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, both 
by Leder et al, U.S. Patent No. 4,873,191 by Wagner et al. and in Hogan, B., 
Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring 
. Harbor, N.Y., 1986). Similar methods are used for production of other transgenic 
animals. A transgenic founder animal can be identified based upon the presence of the 
FDRG transgene in its genome and/or expression of FDRG mRNA in tissues or cells of 
the animals. A transgenic founder animal can then be used to breed additional animals 
carrying the transgene: Moreover, transgenic animals carrying a transgene encoding 
FDRG can further be bred to other transgenic animals carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains 
at least a portion of a FDRG gene into which a deletion, addition or substitution has 
been introduced to thereby alter, e.g. , functionally disrupt, the FDRG gene. The FDRG 
gene can be ahuman gene (e.g., the cDNA of SEQ ID NO:l, SEQ ID NO:3 or SEQ ID 
NO:4), but more preferably, is a non-human homologue of a human FDRG gene. For 
example, amouse FDRG gene (e.g., the cDNA of SEQIDNO:ll, SEQIDNO:13or 
SEQ ID NO:14) can be used to construct a homologous recombination vector suitable 
for altering an endogenous FDRG gene in the mouse genome. In a preferred 



WO 99/45135 PCT/US99/04548 

-53- 

embodiment, the vector is designed such that, upon homologous recombination, the 
endogenous FDRG gene is functionally disrupted {i.e., no longer encodes a functional 
protein; also referred to as a "knock out" vector). Alternatively, the vector can be 
designed such that, upon homologous recombination, the endogenous FDRG gene is 
5 mutated or otherwise altered but still encodes functional protein (e.g., the upstream 
regulatory region can be altered to thereby alter the expression of the endogenous FDRG 
protein). In the homologous recombination vector, the altered portion of the FDRG 
gene is flanked at its 5' and 3' ends by additional nucleic acid of the FDRG gene to allow 
for homologous recombination to occur between the exogenous FDRG gene carried by 

1 0 the vector and an endogenous FDRG gene in an embryonic stem cell. The additional 
flanking FDRG nucleic acid is of sufficient length for successful homologous 
recombination with the endogenous gene. Typically, several kilobases of flanking DNA 
(both at the 5' and 3' ends) are included in the vector (see e.g., Thomas, K.R. and 
Capecchi, M. R. (1987) Cell 51:503 for a description of homologous recombination 

15 vectors). The vector is introduced into an embryonic stem cell line (e.g., by 
electroporation) and cells in which the introduced FDRG gene has homologously 
recombined with the endogenous FDRG gene are selected (see e.g., Li, E. et al (1992) 
Cell 69:915). The selected cells are then injected into a blastocyst of an animal (e.g., a 
mouse) to form aggregation chimeras (see e.g., Bradley, A. in Teratocarcinomas and 

20 Embryonic Stem Cells: A Practical Approach, E.J. Robertson, ed. (IRL, Oxford, 1987) 
pp. 1 1 3- 1 52). A chimeric embryo can then be implanted into a suitable pseudopregnant 
female foster animal and the embryo brought to term. Progeny harboring the 
homologously recombined DNA in their germ cells can be used to breed animals in 
which all cells of the animal contain the homologously recombined DNA by germline 

25 transmission of the transgene. Methods for constructing homologous recombination 
vectors and homologous recombinant animals are described further in Bradley, A. 
(1991) Current Opinion in Biotechnology 2:823-829 and in PCT International 
Publication Nos.: WO 90/1 1 354 by Le Mouellec et ai; WO 91/01 140 by Smithies et 
al.; WO 92/0968 by Zijlstra et aL; and WO 93/04169 by Berns et al. 

30 In another embodiment, transgenic non-humans animals can be produced which 

' contain selected systems which allow for regulated expression of the transgene. One 
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. example of such a system is the cre/loxP. recombinase system of bacteriophage PI . For 
a description of the cre/loxP recombinase system, see, e.g., Lakso et al. (1992) PNAS 
89:6232-6236. Another example of a recombinase system is the FLP recombinase 
system of Saccharomyces cerevisiae (O'Gorman et al. (\99\) Science 251:1351-1355. If 

5 a cre/loxP recombinase system is used to regulate expression of the transgene, animals 
containing transgenes encoding both the Cre recombinase and a selected protein are 
required. Such animals can be provided through the construction of "double" transgenic 
animals, e.g. , by mating two transgenic animals, one containing a transgene encoding a 
selected protein and the other containing a transgene encoding a recombinase. 

1 0 Clones of the non-human transgenic animals described herein can also be 

produced according to the methods described in Wilmut, I. et al. (1997) Nature 385:810- 
813 and PCT Publication Numbers WO 97/07668 and WO 97/07669. In brief, a cell, 
e.g., a somatic cell, from the transgenic animal can be isolated and induced to exit the 
growth cycle and enter G 0 phase. The quiescent cell can then be fused, e.g., through the 

15 use of electrical pulses, to an enucleated oocyte from an animal of the same species from 
which the quiescent cell is isolated. The recontructed oocyte is then cultured such that it 
develops to morula or blastocyte and then transferred to pseudopregnant female foster 
animal. The offspring borne of this female foster animal will be a clone of the animal 
from which the cell, e.g., the somatic cell, is isolated. 

20 

IV. Pharmaceutical Compositions 

The FDRG nucleic acid molecules, FDRG proteins, and anti-FDRG antibodies 
(also referred to herein as "active compounds") of the invention can be incorporated into 
pharmaceutical compositions suitable for administration. Such compositions typically 

25 comprise the nucleic acid molecule, protein, or antibody and a pharmaceutically 

acceptable carrier. As used herein the language "pharmaceutically acceptable carrier" is 
intended to include any and all solvents, dispersion media, coatings, antibacterial and 
antifungal agents, isotonic and absorption delaying agents, and the like, compatible with 
pharmaceutical administration. The use of such media and agents for pharmaceutically 

30 active substances is well known in the art. Except insofar as any conventional media or 
agent is incompatible with the active compound, use thereof in the compositions is 
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contemplated. Supplementary active compounds can also be incorporated into the 
compositions. 

A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration. Examples of routes of administration include 
5 parenteral, e.g. , intravenous, intradermal, subcutaneous, oral (e.g. , inhalation), 

transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions 
used for parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 

1 0 antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic 
acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, 
such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 

15 enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersion. For 

20 intravenous administration, suitable carriers include physiological saline, bacteriostatic 
water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline (PBS). In 
all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action' of microorganisms such as 

25 bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 
example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 
polyethey lene glycol, and the like), and suitable mixtures thereof. The proper fluidity 
can be maintained, for example, by the use of a coating such as lecithin, by the . 
maintenance of the required particle size in the case of dispersion and by the use of 

30 surfactants. Prevention of the action of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 
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ascorbic acid, thimerosal, and the like. In many cases, it will .be preferable to include 
. isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium 
chloride in the composition. Prolonged absorption of the injectable compositions can be 
brought about by including in the composition an agent which delays absorption, for 

5 example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 
compound (e.g. , a FDRG protein or anti-FDRG antibody) in the required amount in an 
appropriate solvent with one or a combination of ingredients enumerated above, as 
required, followed by filtered sterilization. Generally, dispersions are prepared by 

1 0 incorporating the active compound into a sterile vehicle which contains a basic 

dispersion medium and the required other ingredients from those enumerated above. In 

the case of sterile powders for the preparation of sterile injectable solutions, the 

preferred methods of preparation are vacuum drying and freeze-drying which yields a 

powder of the active ingredient plus any additional desired ingredient from a previously 

15 sterile-filtered solution thereof. , 

Oral compositions generally include an inert diluent or an edible carrier. They 

can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 

20 using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutically 
compatible binding agents, and/or adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules* troches and the like can contain any of the 
following ingredients, or compounds of a similar nature: a binder such as 

25 microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel™, or corn starch; a 
lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal silicon 
dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring agent such as 
peppermint, methyl salicylate, or orange flavoring. 



WO 99/45135 PCI7US99/04548 

-57- 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e.g. , a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
5 transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
1 0 compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

The compounds can also be prepared in the form of suppositories (e.g. , with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

1 5 In one embodiment, the active compounds are prepared with carriers that will 

protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 

20 Methods for preparation of such formulations will be apparent to those skilled in the art. 
The materials can also be obtained commercially from Alza Corporation and Nova . 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutical^ 
acceptable carriers. These can be prepared according to methods known to those skilled 

25 in the art, for example,, as described in U.S. Patent No. 4,522,8 11. 

It is especially advantageous to formulate oral or parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 

30 calculated to produce the desired therapeutic effect in association with the required 

r 

pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
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dictated by and directly dependent on the unique characteristics of the active compound 
and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 

Toxicity and therapeutic efficacy of such compounds can be determined by 
5 standard pharmaceutical procedures in cell cultures or experimental animals, eg, for 
determining the LD50 (the dose lethal to 50% of the population) and the EDS 0 (the dose 
therapeutically effective in 50% of the population). The^dose ratio between toxic and 
therapeutic effects is the therapeutic index and it can be 

LD50/ED50. Compounds which exhibit large therapeutic indices are preferred. While 

10 compounds that exhibit toxic side effects may be used, care should be taken to design a 
delivery system that targets such compounds to the site of affected tissue in order to 
minimize potential damage to uninfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in 
formulating a range of dosage for use in humans. The dosage of such compounds lies 

15 preferably within a range of circulating concentrations that include, the ED50 with little 
or no toxicity. The dosage may vary within this range depending upon the dosage form 
employed and the route of administration utilized. For any compound used in the 
method of the invention, the therapeutically, effective dose can be estimated initially 
from cell culture assays A dose may be formulated in animal models to achieve a 

20 ^ circulating plasma concentration range that includes. the IC50 {i.e. , the concentration of 
the test compound which achieves a half-maximal inhibition of symptoms) as 
determined in cell culture. Such information can be used to more accurately determine 
useful doses in humans. Levels in plasma may be measured, for example, by high 
performance liquid chromatography. 

25 The nucleic acid molecules of the invention can be inserted into vectors and used 

as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (see U.S. Patent 5,328,470) or by 
stereotactic injection (see e.g., Chen et al (1 994) PJ^S 91:3054-3057). The, . 
pharmaceutical preparation of the gene therapy vector can include the gene therapy 

30 vector in an acceptable diluent, or can cbmprise a slow release matrix in which the gene 
delivery vehicle is imbedded. Alternatively, where the complete gene delivery vector 
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: can be produced intact from recombinant cells, e.g. , retroviral vectors, the 

pharmaceutical preparation can include one or more cells which produce the gene 
delivery system. 

The pharmaceutical compositions can be included in a container, pack, or 
5' dispenser together with instructions for administration. 

V. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein homologues, and antibodies 
described herein can be used in one or more of the following methods: a) screening 

1 0 assays; b) detection assays {e.g. , chromosomal mapping, tissue typing, forensic biology), 
c) predictive medicine (e.g. , diagnostic assays, prognostic assays, monitoring clinical > 
triais); and d) methods of treatment (e.g., therapeutic and prophylactic methods as well 
as such methods in the context of pharmacogenetics). As described herein, a FDRG 
protein of the invention has one or more of the following activities: (i) activation of a 

15 FDRG-dependent signal transduction pathway; (ii) modulation of angiogenesis; (iii) 
modulation of a hematopoiesis; (iv) modulation of the development or differentiation of 
a FDRG-expressing cell (e.g., mediation of growth and/or differentiation of adipocytes, 
for example, white adipocytes or brown adipocytes); (v) modulation of the development 
or.differentiation of a non- FDRG-expressing cell; (yi) modulation of the homeostasis of 

20 a FDRG-expressing cell; (vii) modulation of insulin sensitivity and/or insulin 
responsiveness; (viii) modulation of insulin secretion (ix) modulation of cell 
recruitment; and (x) modulation of the homeostasis of a non-FDRG-expressing cell. 
Accordingly, a FDRG protein, FDRG modulator, or FDRG receptor modulator can be 
used in, for example, ( 1 ) activation of a FDRG-dependent signal transduction pathway; 

25 '(2) modulation of angiogenesis; (3) modulation of a hematopoiesis; (4) modulation of 
the development or differentiation of a FDRG-expressing cell (e.g., mediation of growth 
and/or differentiation of adipocytes, for example, white adipocytes or brown 
adipocytes); (5) modulation of the development or differentiation of a non- FDRG- 
expressing cell; (6) modulation of the homeostasis of a FDRG-expressing cell; (7) , 

30 modulation of insulin sensitivity and/or insulin responsiveness; (8) modulation of insulin 
secretion (9) modulation of cell recruitment'; ( 1 0) modulation of the homeostasis of a 
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non-FDRG-expressing cell; (1 1) maintenance of energy homeostasis (e.g., regulating the 
balance and/or imbalance between energy storage and energy expenditure, for example, . 
increasing/decreasing energy expenditure); (12) regulation of adaptive thermpgenesis 
.'(e.g., regulation of the biogenesis of mitochondria, regulation of the expression of ; 

5 mitocrondrial enzymes, regulation of expression ^uncoupling proteins; (13) regulation ■.; 
of adiposity; (14) modulation of the efficiency of energy storage; (1 5) regulation of ' 
appetite; and (16) regulation of blood vessel formation. ; 

The isolated nucleic acid molecules of the invention can be used, for example, to 
express FDRG protein (e.g., via a recombinant expression .vector in a host cell in gene 

10 therapy applications), to detect FDRG mRNA (e.g., in a biological sample) or a genetic 
alteration in a FDRG gene, and to modulate FDRG activity, as described further below. 
In addition, the FDRG proteins can be used to screen drugs or compounds which 
modulate the FDRG activity as well as to treat disorders characterized by insufficient or 
excessive production of FDRG protein or production of FDRG protein forms which ; 

15 have decreased or aberrant activity compared to FDRG wild type protein (e.g.,; ,. 

differentiative or developmental disorders, proliferative disorders or cell recruitment , 
disorders). Moreover, soluble forms of the FDRG protein can be usedtobind 
membrane-bound FDRG receptors and influence bioavailability of such a receptors 
- cognate ligand. In addition, the anti-FDRG antibodies of me invention can be used to 

20 detect and isolate FDRG proteins and modulate FDRG activity.. ; 

A. Screening Assays ; v \ v '.' ; 

• > , • ' The invention provides a method (also referredto herein as a "screening assay"! 
for identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides,' . 
25 peptidomimetics, small molecules or other drugs) which bind to FDRG proteins.or have 
a stimulatory or inhibitory effect on, for example, FDRG expression or FDRG activity. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity.of a FDRG protein or polypeptide or 
biologically active portion thereof. In another embodiment, the invention provides 
30 assays for screening candidate or test compounds which bind to or modulate the activity 
ofa FDRG receptor. If 
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The test compounds of the present invention can be obtained using any of the 
numerous approaches in combinatorial library methods known in the art, including: 
biological libraries; spatially addressable parallel solid phase or solution phase libraries; 
synthetic library methods requiring deconvolution; the 'one-bead one-compound 1 library 
5 method; and synthetic library methods using affinity chromatography selection. The 
biological library approach is limited to peptide libraries, while the other four 
approaches are applicable to peptide, non-peptide oligomer or small molecule libraries 
of compounds (Lam, K.S. (1997) Anticancer Drug Des. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the 

10 art, for example in: DeWitt etal (1993) Proc. Natl. Acad. Set U.S.A. 90:6909; Erb et 
ah (1994) Proc, Natl Acad. ScL USA 91:1 1422; Zuckermann etal (1994). J. Med. . 
Chem. 37:2678; Cho et al. (1993) Science 261:1303; Carrell et al (1994) Angew. Chem. 
Int. Ed. Engl. 33:2059; Carell et al. (1994) Angew. Chem. Int. Ed. Engl. 33:2061; and in 
Galloper al. (1994) J. Med Chem. 37:1233. 

1 5 Libraries of compounds may be presented in solution (e.g. , Houghten ( 1 992) 

Biotechniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 
(1993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), spores (Ladner USP 
•409), plasniids (CuWet al. (1992) Proc Natl AcadSci USA &9:m5-m9) ox on phage 
((Scott and Smith (1990) Science 249:386-390); (Devlin (1990) Science 249:404-406); 

20 (CwirlaeM/. (1990) Proc. Natl. Acad. Sci. 87:6378-6382); (FelicL(1991) J. Mol Biol. 
222:301-310); (Ladner supra.)). 

In one embodiment, an assay is a cell-based assay in which a cell which 
expresses FDRG is contacted with a test compound and the ability of the test compound 
to modulate FDRG expression and/or activity determined. Determining the ability of 

25 the compound to modulate FDRG expression can be accomplished, for example, by 
detecting the presence or absence or amount of a FDRG transcript of protein (e.g., using 
a probe based on the nucleotide sequences of the present invention or an anti-FDRG 
antibody). Determining the ability of the compound to modulate FDRG activity can be 
accomplished, for example, by detecting FDRG activity in cellular supernatants (e.g., 

30 contacting a second cell with the supernatants). 
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In another embodiment, an assay is a cell-based assay in which a cell which 
expresses a FDRG receptor on the cell surface is contacted with a test compound and the 
ability of the test compound to bind to a FDRG receptor determined. The cell, for 
example, can be of mammalian origin or a yeast cell. Determining the ability of the test 
compound to bind to a FDRG receptor, can be accomplished, for example, by coupling 
the test compound with a radioisotope or enzymatic label such that binding of the test 
compound to the FDRG receptor can be determined by detecting the labeled compound 
in a complex. For example, test compounds can be labeled with 12 5 I, 3 5 S, 14 C, or 
3 H, either directly or indirectly, and the radioisotope detected by direct counting of 
radioemmission or by scintillation counting. Alternatively, test, compounds can be 
enzymatically labeled with, for example, horseradish peroxidase, alkaline phosphatase, 
or luciferase, and the enzymatic label detected by determination of conversion of an 
appropriate substrate to product. 

It is also within the scope of this invention to determine the ability of a test 
compound to interact with a FDRG receptor without the labeling of any of the 
interactants. For example, a microphysiometer can be used to detect the interaction of a 
test compound with a FDRG receptor without the labeling of either the test compound or 
the receptor. McConnell. H. U.etal (1992) Science 257:1906-1912: As used herein, a 
"microphysiometer" (e.g., Cytosensor™) is an analytical instrument that measures the 
rate at which a cell acidifies its environment using a light-addressable potentiometric 
sensor (LAPS). Changes in this acidification rate can be used as an indicator of the 
interaction between ligand and receptor. ( ■• 

In a preferred embodiment, the assay comprises contacting a cell which 
expresses a FDRG receptor on the cell surface with a FDRG protein or biologically- 
active portion thereof, to form an assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test compound to interact with a FDRG 
receptor, wherein determining the ability of the test compound to interact with a FDRG 
receptor comprises determining the ability of the test compound to preferentially bind to 
the FDRG receptor as compared to the ability of FDRG, or a biologically active portion 
thereof, to bind to the receptor. Alternatively, the assay can comprise determining the 
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ability of the test compound to modulate a cellular activity of FDRG and/or an FDRG 
receptor. 

Determining the ability of the FDRG protein to bind to or interact with a FDRG 
target molecule, or to modulate a FDRG activity, can be accomplished by one of the 
5 methods described above for determining direct binding. In a preferred embodiment, 
determining the ability of the FDRG protein to bind to or interact with a FDRG target 
molecule can be accomplished by determining the activity of the target molecule. For 
example, the activity of the target molecule can be determined by detecting induction of 
a cellular second messenger of the target (Le. intracellular Ca 2+ , diacylgly.cerol, IP 3 , 

1 0 etc.), detecting catalytic/enzymatic activity of the target an appropriate substrate, . 
detecting the induction of a reporter gene (comprising a FDRG-responsive regulatory 
element operatively linked to a nucleic acid encoding a detectable marker, e.g., 
luciferase), or detecting a cellular response, for example, expression of a differentiated 
phenotype. ' 

1 5 In yet another embodiment, an assay of the present invention is a cell-free assay 

in which a FDRG protein or biologically active portion thereof is contacted with a test 
compound and the ability of the test compound to bind to the FDRG protein or 
biologically active portion thereof is determined. Binding of the test compound to the 
FDRG protein can be determined either directly or indirectly as described above. In a 

20 preferred embodiment, the assay includes contacting the FDRG protein or biologically 
active portion thereof with a known compound which binds FDRG to form an assay 
mixture, contacting the assay mixture with a test compound, and determining the ability 
of the test compound to interact with a FDRG protein, wherein determining the ability 
of the test compound to interact with a FDRG protein comprises determining the ability 

25 of the test compound to preferentially bind to FDRG or biologically active portion 
thereof as compared to the known compound. 

In another embodiment, the assay is a cell-free assay in which FDRG protein or 
biologically active portion thereof is contacted with a test compound and the ability of 
the test compound to modulate (e.g. , stimulate or inhibit) the activity of the FDRG - 

30 , protein or biologically active portion thereof is determined. Determining the ability of 
the test compound to modulate the activity of FDRG can be accomplished, for example, 
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by determining the ability of the FDRG protein to bind to a FDRG target molecule (e.g., 
a FDRG binding protein) by one of the methods described above for determining direct 
binding. Determining the ability of the FDRG protein to bind to a FDRG target 
molecule can also be accomplished using a technology such as real-time Biomolocular 
5 Interaction Analysis (BIA). Sjolander, S. and Urbaniczky, C. (1991) Anal. Chem. 
63:2338-2345 and Szabo et al (1995) Curr. Opin. Struct. Biol. 5:699-705. As used 
herein, "BIA" is a technology for studying biospecific interactions in real time, without 
labeling any of the interactants (e.g., BIAcore™). Changes in the optical phenomenon 
surface plasmon resonance (SPR) can be used as an indication of real-time reactions 
10 between biological molecules. 

In an alternative embodiment, determining the ability Of the test compound to 
modulate the activity of FDRG can be accomplished by determining the ability of the 
FDRG protein to further modulate a FDRG target, molecule: For example, the activity of 
the target molecule on a receptor can be determined as previously described. 
15 In yet another embodiment, the cell-free assay involves contacting the FDRG 

protein or biologically active portion thereof with a known compound which binds 
FDRG to form an assay mixture, contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact with a FDRG protein, wherein 
determining the ability of the test compound to interact with a FDRG protein comprises 
20 determining the ability of the FDRG protein to preferentially bind to or modulate the 
activity of a FDRG target molecule. 

The cell-free assays of the present invention are amenable to use of both soluble 
and/or membrane-bound forms of isolated proteins (e.g. FDRG proteins or biologically 
active portions thereof or FDRG target molecules). In the case of cell-free assays in 
25 which a membrane-bound form an isolated protein is used (e.g. , a FDRG target molecule 
or receptor) it may be desirable to utilize a solubilizing agent such that the membrane- 
bound form of the isolated protein is maintained in solution. Examples of such 
solubilizing agents include non-ionic detergents such as n-octylglucoside, n- 
dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N- 
30 memylglucarnide,Triton®X-100,Triton®X-114,Thesit®, Isotridecypoly(ethylene 
glycol ether) n , 3-[(3-cholamidopropyl)dimethylamminio]-l -propane sulfonate 
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(CHAPS), 3-[(3-cholamidopropyl)dimethylamminio]-2-hydroxy- 1 -propane sulfonate 
(CHAPSO), or N-dodecyl=N,N-dimethyl-3-ammonio-l -propane sulfonate. 

In more than one embodiment of the above assay methods of the present 
invention, it may be desirable to immobilize either FDRG or its target molecule to 
5 facilitate separation of complexed from uncomplexed forms of one or both of the 
proteins, as well as to accommodate automation of the assay. Binding of a test 
compound to FDRG, or interaction of FDRG with a target molecule in the presence and 
absence of a candidate compound, can be accomplished in any vessel suitable for 
containing the reactants. Examples of such vessels include microtitre. plates, test tubes, 

10 and micro-centrifuge tubes. In one embodiment, a fusion protein can be provided which 
adds a domain that allows one or both of the proteins to be bound to a matrix. For 
example, glutathione-S-transferase/ FDRG fusion proteins or glutathione-S-; , 
transferase/target fusion proteins can be adsorbed onto glutathione sepharose beads 
(Sigma Chemical, St. Louis, MO) or glutathione derivatized microtitre plates, which are 

1 5 then combined with the test compound or the test compound and either the non-adsorbed 
target protein or FDRG protein, and the mixture incubated under conditions conducive 
to complex formation (e.g., at physiological conditions for salt and pH). Following 
incubation, the beads or microtitre plate wells are washed to remove any unbound 
components, the matrix immobilized in the case of beads, complex determined either 

20 directly or indirectly, for example, as described above. Alternatively, the complexes can 
be dissociated from the matrix, and the level of FDRG binding or activity determined 
using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either FDRG or its target molecule can 

25 be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated FDRG or 
target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques well known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, 
IL), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce 
Chemical). Alternatively, antibodies reactive with FDRG or target molecules but which 

30 do not interfere with binding of the FDRG protein to its target molecule can be 

derivatized to the wells of the plate, and unbound target or FDRG trapped in the wells 
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by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of 
complexes using antibodies reactive with the FDRG or target molecule, as well as 
enzyme-linked assays which rely on detecting an enzymatic activity associated with the 
FDRG or target molecule. 

In another embodiment, modulators of FDRG expression are identified in a 
method wherein a cell is contacted with a candidate compound and the expression of 
FDRG mRNA or protein in the cell is determined. The level of expression of FDRG 
mRNA or protein in the presence of the candidate compound is compared to the level of 
expression of FDRG mRNA or protein in the absence of the candidate compound. The 
candidate compound can then be identified as a modulator of FDRG expression based on 
this comparison. For example, when expression of FDRG mRNA or protein is greater 
(statistically significantly greater) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as a stimulator of FDRG mRNA or 
protein expression. Alternatively, when expression of FDRG mRNA or protein is less 
(statistically significantly less) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as an inhibitor of FDRG mRNA or 
protein expression. The level of FDRG mRNA or protein expression in the cells can be 
determined by methods described herein for detecting FDRG mRNA or protein. 

In yet another aspect of the invention, the FDRG proteins can be used as "bait 
proteins" in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. Patent No. 
5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J. Biol. Chem. 
268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; Iwabuchi et al. 
(1993) Oncogene 8:1693-1696; and Brent WO94/10300), to identify other proteins, 
which bind to or interact with FDRG ("FDRG-binding proteins" or "FDRG-bp") and 
modulate FDRG activity. Such FDRG-binding proteins are also likely to be involved in 
the propagation of signals by the FDRG proteins as, for example, downstream elements 
of a FDRG-mediated signaling pathway. Alternatively, such FDRG-binding proteins 
are likely to be cell-surface molecules associated with non-FDRG expressing cells, 
wherein such FDRG-binding proteins are involved in secondary cytokine production. . 
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The two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that codes for 
FDRG is fused to a gene encoding the DNA binding domain of a known transcription 
5 factor (e.g. , GAL-4). In the other construct, a DNA sequence, from a library of DNA 
sequences, that encodes an unidentified protein ("prey" or "sample") is fused to a gene 
that codes for the activation domain of the known transcription factor. If the "bait" and 
the "prey" proteins are able to interact, in vivo, forming a FDRG-dependent complex, - 
the DNA-binding and activation domains of the transcription factor are brought into 

1 0 close proximity. This proximity allows transcription of a reporter gene (e.g. , LacZ) 
which is operably linked to a transcriptional regulatory site responsive to the 
transcription factor. Expression of the reporter gene can be detected and cell colonies 
containing the functional transcription factor can be isolated and used to obtain the 
cloned gene which encodes the protein which interacts with FDRG. 

1 5 This invention further pertains to novel agents identified by the above-described 

screening assays. Accordingly, it is within the scope of this invention'to further use an 
agent identified as described herein in an appropriate animal model. For example, an 

agent identified as described herein (e.g., a FDRG modulating 'agent, an antisense FDRG 

f . ■ 

nucleic acid molecule, a FDRG-specific antibody, or a FDRG binding partner) can be 
20 . used in an animal model to determine the efficacy, toxicity, or side effects of treatment 
with such an agent Alternatively, an agent identified as described herein can be used in 
an animal model to determine the mechanism of action of such an agent. Furthermore, 
this invention pertains to uses of novel agents identified by the above-described 
screening assays for treatments as described herein. 

25 

B. Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the 
corresponding complete gene sequences) can be used in numerous ways as 
polynucleotide reagents. For example, these sequences can be used to: (i) map their 
30 respective genes on a chromosome; and, thus, locate gene regions associated with 
genetic disease; (ii) identify an individual from a minute biological sample (tissue 
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typing); and (iii) aid in forensic identification of a biological sample. These applications 
are described in the subsections below. 

< 

1. Chromosome Mapping 

5 Once the sequence (or a portion of the sequence) of a gene has been isolated, this 

sequence can be used to map the location of , the gene on a chromosome. This process is 
called chromosome mapping. Accordingly, portions or fragments of the FDRG 
sequences, described herein, can be used to map the location of the FDRG genes, 
respectively, on a chromosome. The mapping of the FDRG sequences to chromosomes 

10 is an important first step in correlating these sequences with genes associated with 
disease. * r 

Briefly, FDRG genes can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp in length) from the FDRG sequences. Computer analysis of the 
FDRG, sequences can be used to rapidly select primers that do not span more than one 

1 5 exon in the genomic DN A, thus complicating the amplification process. These primers 
can then be used for PCR screening of somatic cell hybrids containing individual human 
chromosomes. Only those hybrids containing the human gene corresponding to the 
FDRG sequences will yield an amplified fragment. 

Somatic cell hybrids are prepared by fusing somatic cells from different 

20 mammals {e.g., human and mouse cells). As hybrids of human and mouse cells grow 
and divide, they gradually lose human chromosomes in random order, but retain the 
mouse chromosomes. By using media in which mouse cells cannot grow, because they 
lack a particular enzyme, but human cells can, the one human chromosome that contains 
the gene encoding the needed enzyme, will be retained. By using various media, panels 

25 of hybrid cell lines can be established. Each cell line in a panel contains either a single 
human chromosome or a small number of human chromosomes, and a full set of mouse 
chromosomes, allowing easy mapping of individual genes to specific human 
chromosomes. (D'Eustachio P. et al (1983) Science 220:91 9-924). Somatic cell 
hybrids containing only fragments of human chromosomes can also be produced by 

30 using human chromosomes with translocations and deletions. 
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PGR mapping of somatic cell hybrids is a rapid procedure for assigning a • 1 
particular sequence to a particular chromosome. Three or more sequences can be 
assigned per day using a single thermal cycler. Using the FDRG sequences to design 
oligonucleotide primers, sublocalization can be achieved with panels of fragments from 
5 specific chromosomes. Other mapping strategies which can similarly be used to map a 
9o, lp, or 1 v sequence to its chromosome include in situ hybridization (described in Fan, 
Y. et al (1990) PNAS, 87:6223-27), pre-screening with labeled flow-sorted 
chromosomes, and pre-selection by hybridization to chromosome specific cDNA 
libraries. 

1 0 Fluorescence in situ hybridization (FISH) of a DN A sequence to a metaphase 

chromosomal spread can further be used to provide a precise chromosomal location in .. 
one step. Chromosome spreads can be made using cells whose division has been 
blocked in metaphase by a chemical like colcemid that disrupts the mitotic spindle, the 
chromosomes can be treated briefly with trypsin, and then stained with Giemsa. A 

1 5 pattern of light and dark bands develops on each chromosome, so that the chromosomes 
can be identified individually. The FISH technique can be used with a DNA sequence 
as short as 500 or 600 bases. However, clones larger than 1 ,000 bases have a higher 
likelihood of binding to a unique chromosomal location with sufficient signal intensity 
for simple detection. Preferably 1 ,000 bases, and more preferably 2,000 bases will 

20 suffice to get good results at a reasonable amount of time. For a review of this 
technique, see Verma et al> Human Chromosomes: A Manual of Basic Techniques 
(Pergamon Press, New York 1988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for 

25 marking multiple sites and/or multiple chromosomes. Reagents corresponding to 
noncoding regions of the genes actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene families, thus increasing the . 
chance of cross hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the 

30 physical position of the sequence on the chromosome can be correlated with genetic map 
data. (Such data are found, for example, in V. McKusick, Mendelian Inheritance in 
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Afowi available on-line through Johns Hopkins University Welch Medical Library). The 
"". relationship between genes and disease, mapped to the same chromosomal region, can 
' ■ then be identified through linkage analysis (co-inheritance of physically adjacent genes), 

.described in, for example, Egeland, J> et al. (1987) Nature, 325:783-787. 
5 Moreover. differences in the DNA sequences between individuals affected and 

- unaffected with a disease associated with the FDRG gene, can be determined. If a 
mutation is observed in some or all of the affected individuals but not in any unaffected 
individuals, then the mutation is likely to be the causative agent of the particular disease. 
Comparison of affected and unaffected individuals generally involves first looking for 
1 0 structural alterations in the chromosomes, such as deletions or translocations that are 
visible from chromosome spreads or detectable using PCR based on that DNA sequence. 
Ultimately, complete sequencing of genes from several individuals can be performed to 
confirm the presence of a mutation and to distinguish mutations from polymorphisms. 

15 2. Tissue Typing 

The FDRG sequences of the jpresent invention can also be used to identify 
individuals from minute biological samples. The United States military, for example, is 
considering the use of restriction fragment length polymorphism (RFLP) for 
identification of its personnel. In this technique, an individual's genomic DNA is 
20 digested with one or more restriction enzymes, and probed on a Southern blot to yield 
unique bands for identification. This method'does not suffer from the current limitations 
of "Dog Tags" which can be lost, switched, or stolen, making positive identification 
difficult. The sequences of the present invention are useful as additional DNA markers 
for RFLP (described in U.S. Patent 5,272,057)/ 
25 Furthermore, the sequences of the present invention can be used to provide an 

alternative technique which determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the FDRG sequences described 
herein can be used to prepare two PGR primers from the 5' and 3- ends of the sequences. 
These primers can then be used to amplify an individual's DNA and subsequently 
30 sequence it. , ' * . 
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Panels of corresponding DNA sequences from individuals, prepared in this 
manner, can provide unique individual identifications, as each individual will have a 
unique set of such DNA sequences due to allelic differences. The sequences of the 
present invention can be used to obtain such identification sequences from individuals 
5 and from tissue. The FDRG sequences of the invention uniquely represent portions of 
the human genome. Allelic variation occurs to some degree in the coding regions of 
these sequences, and to a greater degree in the noncoding regions. It is estimated that 
allelic variation between individual humans occurs with a frequency of about once per 
each 500 bases. Each of the sequences described herein can, to some degree, be used as. 

1 0 a standard against which DNA from an individual can be compared for identification * 
purposes. Because greater numbers of polymorphisms occur in the noncoding regions, 
fewer sequences are necessary to differentiate individuals. The noncoding sequences of 
SEQ ID NO:l can comfortably provide positive individual identification with a panel of 
perhaps 10 to 1,000 primers which each yield a noncoding amplified sequence of 100 

15 bases. If predicted coding sequences, such as those in SEQ ID NO:3 are used, a more 
appropriate number of primers for positive individual identification would be 500-2,000. 

If a panel of reagents from FDRG sequences described herein is used to generate 
a unique identification database for an individual, those same reagents can later be used 
to identify tissue from that individual. Using the unique identification database, positive 

20 identification of the individual/living or dead, can-be made from extremely small tissue 
samples. . 

3. Use of Partial FDRG Sequences in Forensic Biology 

DNA-based identification techniques can also be used in forensic biology. 
25 Forensic biology is a scientific field employing genetic typing of biological evidence 
found at a crime scene as a means for positively identifying, for example, a perpetrator 
of a crime. To make such an identification, PCR technology can be used to amplify 
. DNA sequences taken from very small biological samples such as tissues, e.g. , hair or • 
skin, or body fluids, e.g., blood, saliva, or semen found at a crime scene. The amplified 
30 sequence can then be compared to a standard, thereby allowing identification of the 
origin of the biological sample. 
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The sequences of the present invention can be used to provide polynucleotide 
reagents, e.g., PCR primers, targeted to specific loci-in the human genome, which can 
enhance the reliability of DNA-based forensic identifications by, for example, providing 
another "identification marker" (Le. another DNA sequence that is unique to a particular 

5 individual). As mentioned above,- actual base sequence information can be used for 
identification as an accurate alternative to patterns formed by restriction enzyme 
generated fragments. Sequences targeted to noncoding regions of SEQ ID NO:l are 
particularly appropriate for this use as greater numbers of polymorphisms occur in the 
noncoding regions, making it easier to differentiate individuals using this technique.- 

10 Examples of polynucleotide reagents include the FDRG sequences or portions thereof, 
e.g., fragments derived from the noncoding regions of SEQ ID NO;l having a length of 
at least 20 bases, preferably at least 30 bases. 

The FDRG sequences described herein can further be used to provide 
polynucleotide reagents, e.g., labeled or labelable probes which can be used in, for 

15 example, an in situ hybridization technique, to identify a specific tissue, e.g., brain 
tissue. This can be very useful in cases where a forensic pathologist is presented with a 
tissue of unknown origin. Panels of such FDRG probes can be used to identify tissue by 

species and/or by organ type. 

In a similar fashion, these reagents, e.g., FDRG primers or probes can be used to 
20 screen tissue culture for contamination (i.e. screen for the presence of a mixture of 
different types of cells in a culture). 

C. Predictive Medicine ; 

The present invention also pertains to the field of predictive medicine in which 
25 diagnostic assays, prognostic assays, and monitoring clinical trials are used for 
prognostic (predictive) purposes to thereby treat an individual prophylactically. 
Accordingly, one aspect of the present invention relates to diagnostic assays for 
determining FDRG protein and/or nucleic acid expression as well as FDRG activity, in 
■ the context of a biological sample (e.g. , blood, serum, cells, tissue) to thereby determine 
30 whether an individual is afflicted with a disease or disorder, or is at risk of developing a 
disorder, associated with aberrant FDRG expression or activity. The invention also 
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provides for prognostic (or predictive) assays for determining whether an individual is at 
risk of developing a disorder associated with FDRG protein, nucleic acid expression or 
activity. For example, mutations in a FDRG gene can be assayed in a biological 
sample. Such assays can be used for prognostic or predictive purpose to thereby 
5 phophylactically treat an individual prior to the onset of a disorder characterized by or 
associated with FDRG protein, nucleic acid expression or activity. 

Another aspect of the invention pertains to monitoring the influence of agents 
(e.g. , drugs, compounds) on the expression or activity of FDRG in clinical trials. 
10 These and other agents are described in further detail in the following sections. 

1. Diagnostic Assays 

An exemplary method for detecting the presence or absence of FDRG in a 
biological sample involves obtaining a biological sample from a test subject and 

15 contacting the biological sample with a compound or an agent capable of detecting . 
FDRG protein or nucleic acid (e.g., mRNA, genomic DNA) that encodes FDRG protein 
such that the presence of FDRG is detected in the biological sample. A preferred agent 
for detecting FDRG mRNA or genomic DNA is a labeled nucleic acid probe capable of 
hybridizing to FDRG mRNA or genomic DNA. The nucleic acid probe can be, for 

20 example, a full-length FDRG nucleic acid, such as the nucleic acid of SEQ ID NO: 1 , 
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:l 1, SEQ ID NO: 13, SEQ ID NO: 14, the 
DNA insert of the plasmid deposited with ATCC as Accession Number 98632, or a 
portion thereof, such as an oligonucleotide of at least 15,30, 50, 1 00, 250 or 500 
nucleotides in length and sufficient to specifically hybridize under stringent conditions 

25 to FDRG mRNA or genomic DNA. Other suitable probes for use in the diagnostic 
assays of the invention are described herein. 

A preferred agent for detecting FDRG protein is an antibody capable of binding 
to FDRG protein, preferably an antibody with a detectable label. Antibodies can be 
polyclonal, or more preferably, monoclonal. An intact antibody, or a fragment thereof 

30 (e.g. , Fab or F(ab')2) can be used. The term "labeled", with regard to the probe or 

antibody, is intended to encompass direct labeling of the probe or antibody by coupling 
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physically linking) a detectable substance to the probe or antibody, as well as 
indirect labeling of the probe or antibody by reactivity with another reagent that is 
directly labeled. Examples of indirect labeling include detectionof a primary antibody 
using a fluorescently labeled secondary antibody and end-labeling of a DNA probe with 
5 biotin such that it can be detected with fluorescently labeled streptavidin. Theterm 
"biological sample" is intended to include tissues, ceUs and biological fluids isolated 
from a subject, as well as tissues, cells and fluids present within a subject. That is, the 
detection method of the invention can be used to detect FDRG mRNA, protein, or 
genomic DNA in a biological sample i« vitro as well as in vivo. For example, in vitro 
10 techniques for detection of FDRG mRNA include Northern hybridizations and in situ 
. hybridizations. In vitro techniques for detection of FDRG protein include enzyme 
linked immunosorbent assays (ELISAs), Western blots, immunoprecipitations and 
immunofluorescence. In vitro techniques for detection of FDRG. genomic DNA include 
Southern hybridizations. Furthermore, in vivo techniques for detection of FDRG protein 
15 include introducing into ; a subject a labeled anti-FDRG antibody. For example, the 
antibody can be labeled with a radioactive marker whose presence and location in a 
subject can be detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the 
test subject. Alternatively, the biological sample can contain mRNA molecules from the 
20 test subject or genomic DNA molecules from the test subject. A preferred biological 
sample is a serum sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control 
biological sample from a control subject, contacting the control sample with a 
compound or agent capable of detecting FDRG protein, mRNA, or genomic DNA, such 
25 that the presence of FDRG protein, mRNA or genomic DNA is detected in the 

biological sample, and comparing the presence of FDRG protein, mRNA or genomic 
DNA in the control sample with the presence of FDRG protein, mRNA or genomic 

DNA in the test sample. 

The invention also encompasses kits for detecting the presence of FDRG in a 
30 biological sample. For example, the kit can comprise a labeled compound or agent 
capable of detecting FDRG protein or mRNA in a biological sample; means for 
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determining the amount of FDRG in the sample; and means for comparing the amount 
of FDRG in the sample with a standard. The compound or agent can be packaged in a 
suitable container. The kit can further comprise instructions for using the kit to detect 
FDRG protein or nucleic acid. 

5 

2. Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant 
FDRG expression or activity. For example, the assays described herein, such as the 

10 preceding diagnostic assays or the following assays, can be utilized to identify a subject 
having or at risk of developing a disorder associated with FDRG protein, nucleic acid 
expression or activity such as a proliferative disorder, a differentiative or developmental 
disorder, a cell migration disorder or a hematopoietic disorder. Thus, the present 
invention provides a method for identifying a disease or disorder associated with 

15 : aberrant FDRG expression or activity in which a test sample is obtained from a subject 
and FDRG protein or nucleic acid (e.g, mRNA, genomic DNA) is detected, wherein the 
presence of FDRG protein or nucleic acid is diagnostic for a subject having or at risk of 
developing a disease or disorder associated with aberrant FDRG expression or activity. 
As used herein, a "test sample" refers to a biological sample obtained from a subject of 

20 interest. For example, a test sample can be a biological fluid (e.g. , serum), cell sample,, 
ortissue. 

Furthermore, the prognostic assays described herein can be used to determine 
whether a subject can be administered an agent (e.g. , an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) 

25 to treat a disease or disorder associated with aberrant FDRG expression or activity. For 
example, such methods can be used to determine whether a subject can be effectively 
treated with an agent for a disorder, such as a proliferative disorder, a differentiative or 
developmental disorder, a cell migration or recruitment disorder, or a hematopoietic 
disorder. Thus, the present invention provides methods for determining whether a 

30 subject can be effectively treated with an agent for a disorder associated with aberrant 
FDRG expression or activity in which a test sample is obtained and FDRG protein or 
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nucleic acid is detected (e.g., wherein the presence of FDRG protein or nucleic acid is 
diagnostic for a subject that can be administered the agent to treat a disorder associated 
with aberrant FDRG expression ^or activity.) 

The methods of the invention can also be used to detect genetic alterations in a 
5 FDRG gene, thereby determining if a subject with the altered gene is at risk for a 
disorder characterized by aberrant differentiation, aberrant angiogenesis, an aberrant 
proliferative response or anaberrant hematopoietic response. In preferred embodiments, 
the methods include detecting, in a sample of cells from the subject, the presence or 
absence of a genetic alteration characterized by at least one of an alteration affecting the 

1 0 integrity of a gene encoding a FDRG protein, or the mis-expression of the FDRG gene. 
For example, such genetic alterations can be detected by ascertaining the existence of at 
least one of 1) a deletion of one or more nucleotides from a FDRG gene; 2) an addition 
of one or more nucleotides to a FDRG gene; 3) a substitution of one or more 
nucleotides of a FDRG gene, 4) a chromosomal rearrangement of a FDRG gene; 5) an 

15 alteration in the level of a messenger RNA transcript of a FDRG gene, 6) aberrant 

modification of a FDRG gene, such as of the methylation pattern of the genomic DNA, 
7) the presence of a non-wild type splicing pattern of a messenger RNA transcript of a 
FDRG gene, 8) a non-wild type level of a FDRG protein, 9) allelic loss of a FDRG 
gene, and 10) inappropriate post-translational modification of a FDRG protein. As r 

20 described herein, there are a large number of assay techniques known in the art which 
can be used for detecting alterations in a FDRG gene. A preferred biological sample is 
serum sample isolated by conventional means from a subject. 

In certain embodiments, detection of the alteration involves the use of a 
probe/primer in a polymerase chain reaction (PCR) (see, e.g. , U.S. Patent Nos. 

25 4,683,195 and 4,683,202), such as anchor PCR or RACE PCR, or, alternatively, in a 
ligation chain reaction (LCR) (see, e.g., Landegran et al (1988) Science 241:1077-1080; 
and Nakazawa et al (1994) PNAS 91 :360-364), the latter of which can be particularly 
useful for detecting point mutations in the FDRG gene (see Abravaya et al (1995) 
Nucleic Acids Res .23 :675-682). This method can include the steps of collecting a 

30 sample of cells from a patient, isolating nucleic acid {e.g., genomic, mRNA or both) 
from the cells of the sample, contacting the nucleic acid sample with one or more 
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primers which specifically hybridize to a FDRG gene under conditions such that 
hybridization and amplification of the FDRG gene (if present) occurs, and detecting the 
presence or absence of an amplification product, or detecting the size of the 
amplification product and comparing the length to a control sample. It is anticipated 
5 that PCR and/or LCR may be desirable to use as a preliminary amplification step in 
conjunction with any of the techniques used for detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication 
(Guatelli, J.C. et ai, 1990, Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh, D.Y. et ai, 1989, Proc. Natl. Acad. Sci. USA 86:1173- 

10 1 177), Q-Beta Replicase (Lizardi, P.M. et all, 1988, Bio/Technology 6:1 197), or any 
other nucleic acid amplification method, followed by the detection of the amplified 
molecules using techniques well known to those of skill in the art. These detection 
schemes are especially useful for the detection of nucleic acid molecules if such 
molecules are present in very low numbers. 1 

15 In an alternative embodiment, mutations in a FDRG gene from a sample cell can 

be identified by alterations in restriction enzyme cleavage patterns. For example, 
sample and control DNA is isolated, amplified (optionally), digested with one or more 
restriction endonucleases, and fragment length sizes are determined by gel 
electrophoresis and compared. Differences in fragment length sizes between sample and 

20 control DNA indicates mutations in the sample DNA. Moreover, the use of sequence 
specific ribozynies (see, for example, U.S. Patent No. 5,498,53 1 ) can be used to score 
for the presence of specific mutations by development or loss of a ribozyme cleavage 
site. 

- In other embodiments, genetic mutations in FDRG can be identified by 
25 hybridizing a sample and control nucleic acids, e.g., DNA or RN A, to high density 
arrays containing hundreds or thousands of oligonucleotides probes (Cronin, M.T. et ai 
(1996) Human Mutation 7: 244-255; Kozal, MJ. et al (1996) Nature Medicine 2: 753- 
759). For example, genetic mutations in FDRG can be identified in two dimensional 
arrays containing light-generated DNA probes as described in Cronin, M.T. et al. supra. 
30 Briefly, a first hybridization array of probes can be used to scan, through long stretches 
of DNA in a sample and control to identify base changes between the sequences by 
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making linear arrays of sequential ovelapping probes. This step allows the identification 
of point mutations. This step is followed by a second hybridization array that allows the 
characterization of specific mutations by using smaller, specialized probe arrays 
complementary to all variants or mutations detected. Each mutation array is composed 
of parallel probe sets, one complementary to the wild-type gene and the other 
complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in 
the art can be used to directly sequence the FDRG gene and detect mutations by 
comparing the sequence of the sample FDRG with the corresponding wild-type (control) 
sequence. Examples of sequencing reactions include those based on techniques 
developed by Maxim and Gilbert ((1977) PNAS 74:560) or Sanger (Naeve et al. (1977) 
PNAS 74:5463). It is also contemplated that any of a variety of automated sequencing 
procedures can be utilized when performing the diagnostic assays ((1995) Biotechniques 
19:448), including sequencing by mass spectrometry (see, e.g., ?CT International 
Publication No. WO 94/16101; Cohen et al. (1996) Adv. Chromatogr. 36:127-162; and 
Griffin et al. (\993) Appl. Biochem. Biotechnol. 38:147-159). 

Other methods for detecting mutations in the FDRG gene include methods in 
which protection from cleavage agents is used to detect mismatched bases in RNA/RNA 
or RNA/DNA heteroduplexes (Myers etal. (1985) Science 230:1242). In general, the 
art technique of "mismatch cleavage" starts by providing heteroduplexes of formed by 
hybridizing (labeled) RNA or DNA containing the wild-type FDRG sequence with 
potentially mutant RNA or DNA obtained from a tissue sample, the double-stranded 
duplexes are treated with an agent which cleaves single-stranded regions of die duplex 
such as which will exist due to basepair mismatches between the control and sample 
strands. For instance, RNA/DNA duplexes can be treated with RNase and DNA/DNA 
hybrids treated with SI nuclease to enzymatically digesting the mismatched regions. In 
other embodiments, either DNA/DNA or RNA/DNA duplexes can be treated with 
hydroxylamine or osmium tetroxide and with piperidine in order to digest mismatched 
regions. After digestion of the mismatched regions, the resulting material is then 
separated by size on denaturing polyacrylamide gels to determine the site of mutation. 
See, for example, Cotton et al (1988) Proc. Natl Acad Sci USA 85:4397; Saleeba et al 
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(1992) Methods Enzymol 217:286-295. In a preferred embodiment, the control DNA or 
RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or 
more proteins that recognize mismatched base pairs in double-stranded DNA (so called 
5 "DNA mismatch repair" enzymes) in defined systems for detecting and mapping point 
mutations in FDRG cDNAs obtained from samples of cells. For example, the mutY 
enzyme of £ coli cleaves A at G/A mismatches and the thymidine DNA glycosylate , 
from HeLa cells cleaves T at G/T mismatches (Hsu et al (1994) Carcinogenesis 
15:1 657-1 662). According to an exemplary embodiment, a probe based on a FDRG 

1 0 sequence, e.g. , a wild-type FDRG sequence, is hybridized to a cDNA or other DNA 
product from a test cell(s). The duplex is treated with a DNA mismatch repair enzyme, 
and the cleavage products, if any, can be detected from electrophoresis protocols or the 
like. See, for example, U.S. Patent No. 5,459,039. 

In other embodiments, alterations in electrophoretic mobility will be used to 

1 5 identify mutations in FDRG genes. For example, single strand conformation 

polymorphism (SSCP) may be used to detect differences in electrophoretic mobility 
between mutant and wild type nucleic acids (Orita et al (1989) Proc Natl. Acad. Sci 
USA: 86:2766, see also Cotton (1993) Mutat Res 285:125-144'; and Hayashi (1992) 
Genet Anal Tech Appl 9:73-79). Single-stranded DNA fragments of sample and control 

20 FDRG nucleic acids will be denatured and allowed to renature. The secondary structure 
of single-stranded nucleic acids varies according to sequence, the resulting alteration in 
electrophoretic mobility enables the detection of even a single base change. The DNA 
.fragments may be labeled or detected with labeled probes. The sensitivity of the assay 
may be enhanced by using RNA (rather than DNA), in which the secondary structure is 

25 more sensitive to a change in sequence. In a preferred embodiment, the subject method 
utilizes heteroduplex analysis to separate double stranded heteroduplex molecules on the 
basis of changes in electrophoretic mobility (Keen et al (1991) Trends Genet 7:5). ,< 
In yet another embodiment the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 

30 gradient gel electrophoresis (DGGE) (Myers etal (1985) Nature 3 13:495). When" 

DGGE is used as the method of analysis, DNA will be modified to insure that it does not 
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completely denature, for example by adding a GC clamp of approximately 40 bp of 
high-melting GC-rich DNA by PCR. In a further embodiment, a temperature gradient is 
used in place of a denaturing gradient to identify differences in the mobility of control 
and sample DNA (Rosenbaum and Reissner (1987) Biophys Chem 265:12753). 
: 5 Examples of other techniques for detecting point mutations include, but are not 

limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
which permit hy bridization only if a perfect match is found (Saiki et al (1986) Nature 
10 324:163);, Saiki et al (1989) Proc. Natl Acad Sci USA 86:6230). Such allele specific 
oligonucleotides are hybridized to PCR amplified target DNA or a number of different . 
mutations when the oligonucleotides are attached to.the hybridizing membrane and 
hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology which depends on 
15 selective PCR amplification may be used in conjunction with the instant invention. , 
Oligonucleotides used as primers for specific amplification may carry the mutation of 
interest in the center of the molecule (so that amplification depends on differential 
hybridization) (Gibbs et al (1989) Nucleic Acids Res. 17:2437-2448) or at the extreme 
3' end of one primer where, under appropriate conditions, mismatch can prevent, or 
20 reduce polymerase extension (Prossner (1993) Tibtech l 1 :238). In addition it may be 
desirable to introduce a novel restriction site in the region of the mutation to create 
cleavage-based detection (Gasparini et al (1 992) Mol Cell Probes 6:1). It is anticipated 
that in certain embodiments amplification may also be performed using Tacj ligase for 
amplification (Barany (1991) Proc. Natl Acad Sci USA 88:189). Iasuch cases, ligation 
25 will occur ohly if there is a perfect match at the 3* end of the 5' sequence making it 
possible to detect the presence of a known mutation at a specific site by looking for the 
, presence or absence of amplification. 

The methods described herein may be performed, for example, by utilizing pre- 
; packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
30 described herein, which may be conveniently used, e.g., in clinical settings to diagnose 
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patients exhibiting symptoms or family history of a disease or illness involving a FDRG 
gene. 

Furthermore, any cell type or tissue in which FDRG is expressed may be utilized » 
in the prognostic assays described herein. 
5 , 

3. Monitoring of Effects During Clinical Trials 
Monitoring the influence of agents (e.g., drugs, compounds) on the expression or 
activity of FDRG (e.g., activation of a FDRG-dependent signal transduction pathway; , 
modulation of angiogenesis or cellular proliferation; modulation of cell migration or 

1 0 recruitment; or modulation of a hematopoietic response) can be applied not only in basic 
drug screening, but also in clinical trials. For example, the effectiveness of an agent 
determined by a screening assay as described herein to increase FDRG gene expression, 
protein levels, or upregulate FDRG activity, can be monitored in clinical trials of • 
subjects exhibiting decreased FDRG gene expression, protein levels, or dowhregulated 

1 5 FDRG activity. Alternatively, the effectiveness of an agent determined by a screening 
assay to decrease FDRG gene expression, protein levels, or downregulate FDRG 
activity^ can be monitored in clinical trials of subjects exhibiting increased FDRG gene 
expression, protein levels, or upregulated FDRG activity. In such clinical trials, the 
expression or activity of FDRG and, preferably, other genes that have been implicated 

20 in, for example, a proliferative disorder can be used as a "read out' 1 or markers of the 
proliferation of a particular cell. 

For example, and not by way of limitation, genes, including FDRG, that are 
modulated in cells by treatment with an agent (e.g. , compound, drug or small molecule) 
which modulates FDRG activity (e.g. , identified in a screening assay as described 

25 herein) can be identified. Thus, to study the effect of agents on proliferative disorders, 
developmental disorder, or hematopoietic disorder, for example, in a clinical trial, cells 
can be isolated and RNA prepared and analyzed for the levels of expression of FDRG 
and other genes implicated in the proliferative disorder, developmental disorder, or 
hematopoietic disorder, respectively. The levels of gene expression (i.e., a gene 

30 expression pattern) can be quantified by Northern blot analysis or RT-PCR, as described 
herein, or alternatively by measuring the amount of protein produced, by one of the 
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methods as described herein, or by measuring the levels of activity ofFDRG or other 
genes. In this way, the gene expression pattern can serve as a marker, indicative of the 
..physiological response of the cells to the agent. Accordingly, this response state may be 
determined before, and at various points during,.treatment of the individual with the 

5 agent. • x- 

In a preferred embodiment, the present invention provides a method for 

monitoringthe effectiveness of treatment of a subject with an agent (e.g., an agonist, - 
antagonist,.peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug 
candidate identified by the screening assays described herein) comprising the steps of (i) 
1 0 - obtaining a pre-administration sample from a subject prior to administration of the 
agent; (ii) detecting the level of expression of a FDRG protein, mRNA, or genomic 
DNA in the preadministration sample; (iii) obtaining one or more post-administration 
samples from the subject; (iv) detecting the level of expression or activity of the FDRG 
protein, mRNA, or genomic DNA in the post-administration samples; (v) comparing the 
15 level of expression or activity of the FDRG protein, mRNA* or genomic DNA in the 
. pre-administration sample with the FDRG protein, mRNA, or genomic DNA in the post 
administration sample or samples ; and (vi) altering the administration of the agent to the 
' subject accordingly. For example, increased administration of the agent may be 
desirable to increase the expression or activity of FDRG to higher levels than detected, 
20 i.e., to increase the effectiveness of the agent. Alternatively, decreased administration of 
the agent may be desirable to decrease expression or activity ofFDRG to lower levels 
.than detected, i.e. to decrease the effectiveness of the agent. According to such an 
embodiment, FDRG expression or activity may be used as an indicator of the 
effectiveness of an agent, even in the absence of an observable phenotypic response. 



25 

C. Methods of Treatment ; 

The present invention provides for both prophylactic and therapeutic methods of 
treating a subject at risk of (or susceptible to) a disorder or having a disorder associated 
* with aberrant FDRG expression or activity. With regards to both prophylactic and 

30 'therapeutic methods of 

modified, based on knowledge obtained from the field of pharmacogenomics. k 
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"Pharmacogenomics", as used herein, refers to the application of genomics technologies 
; r > such as gene sequencing, statistical genetics, and gene expression analysis to drugs in 
clinical development and on the market. More specifically, the term refers the study of 

4*.-. *■ . 

e v how a patient's genes determine his or her response to a drug {e.g., a patient's "drug 
• 5 response phenotype", or "drug response genotype".) Thus, another aspect of the 
. . invention provides methods for tailoring an individual's prophylactic or therapeutic 
treatment with either the FDRG molecules of the present invention or FDRG modulators 
' according to that individual's drug response genotype. Pharmacogenomics allows a 
clinician or physician to target prophylactic or therapeutic treatments to patients who 
10 will most benefit from the treatment and to avoid treatment of patients who will 
experience toxic drug-related side effects. 
f% 

' 1. Prophylactic Methods 

In one aspect, the invention provides a method for preventing in a subject, a 

15 disease or condition associated with an aberrant FDRG expression or activity, by 

administering to the subject an agent which modulates FDRG expression or at least one 

; FDRG activity. Subjects at risk for a disease which is caused or contributed to by 
aberrant FDRG expression or activity can be identified by, for example, any or a 
combination of diagnostic or prognostic assays as described herein. Administration of a 

20 prophylactic agent can occur prior to the manifestation of symptoms characteristic of the 
FDRG aberrancy, such that a disease or disorder is prevented or, alternatively, delayed 
in its progression. Depending on the type of FDRG aberrancy, for example, a FDRG- 
agonist or FDRG antagonist agent can be used for treating the subject. The appropriate 
agent can be determined based on screening assays described herein. 

25 . 4 < 

2. Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating FDRG 
expression or activity for therapeutic purposes. The modulatory method of the invention 

involves contacting a cell with an agent that modulates one or more of the activities of 

♦ 

30 FDRG protein activity associated with the cell. An agent that modulates FDRG protein 
activity can be an agent as described herein, such as a nucleic acid or a protein, a 
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naturally-occurring target molecule of a FDRG protein, a peptide, a FDRG 
peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one 
or more FDRG protein activity. Examples of such stimulatory agents include active ' 
FDRG protein and a nucleic acid molecule encoding FDRG that has been introduced 
5 into the cell. In another embodiment, the agent inhibits one or more FDRG protein 
activity. Examples of such inhibitory agents include antisense FDRG nucleic acid 
molecules and anti-FDRG antibodies. These modulatory methods can be performed in 
vitro (e.g. , by culturing the cell with the agent) or, alternatively, in vivo (e.g, by 
administering the agent to a subject). As such, the present invention provides methods ? 

1 0 of treating an individual afflicted with a disease or disorder characterized by aberrant 
expression or activity of a FDRG protein or nucleic acid molecule. In one embodiment, 
the method involves administering an agent (e.g., an agent identified by a screening 
assay described herein), or combination of agents that modulates (e.g., upregulates or 
downregulates) FDRG expression or activity. In another embodiment, the method 

1 5 involves administering a FDRG protein or nucleic acid molecule as therapy to 
compensate for reduced or aberrant FDRG expression or activity. 

A preferred embodiment of the present invention involves a method for treatment 
of a disease or disorder associated with a FDRG protein which includes the step of 
administering a therapeutically effective amount of an antibody to a FDRG protein to a 

20 subject. As defined herein, a therapeutically effective amount of antibody (ie. 9 an 
effective dosage) ranges from about 0.001 to 30 mg/kg body weight, preferably about 
0.01 to 25 mg/kg body weight, more preferably about 0.1 to 20 mg/kg body weight, and 
even more preferably about 1 to 1 0 mg/kg, 2 to 9 mg/kg, 3 to 8 mg/kg, 4 to 7 mg/kg, or 
5 to 6 mg/kg body weight. The skilled artisan will appreciate that certain factors may 

25 influence the dosage required to effectively treat a subject, including but not limited to 
the severity of the disease or disorder, previous treatments, the general health and/or age 
of the subject, and other diseases present. Moreover, treatment of a subject with a 
therapeutically effective amount of an antibody can include a single treatment or, 
preferably, can include a series of treatments. In a preferred example, a subject is treated 

30 with antibody in the range of between about 0. 1 to 20 mg/kg body weight, one time per 
week for between about 1 to 10 weeks, preferably between 2 to 8 weeks, more 
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preferably between about 3 to 7 weeks, and even more preferably for about 4, 5, or 6 . 
weeks. It will also be appreciated that the effective dosage of antibody used for 
treatment may increase or decrease over the course of a particular treatment. Changes in 
dosage may result from the results of diagnostic assays as described herein. 
5 ^ Stimulation ofFDRG activity is desirable in situations in which FDRGis 

abnormally downregulated and/or in which increased FDRG activity is likely to have a 
beneficial effect. One example of such a situation is where a subject has a disorder 
characterized by aberrant proliferative disorder (e.g., cancer). Yet another example of 
such a situation is where a subject has a disorder characterized by an aberrant 
1 0 hematopoietic response. Yet another example of such a situation is where a subject has 
a disorder characterized by aberrant differentiation or development or aberrant cell 
migration. 

• ■ r ... 

3. Pharmacogenomics 

The FDRG molecules of the present invention, as well as agents, or modulators 

1 5 which have a stimulatory or inhibitory effect on FDRG activity (e.g. , FDRG gene 
expression) as identified by a screening assay described herein can be administered to 
. individuals to treat (prophylactically or therapeutically) disorders (e.g, proliferative 
disorders or developmental disorders) associated with aberrant FDRG activity. In 
conjunction with such treatment, pharmacogenomics (i.e., the study of the relationship 

20 between an individual's genotype and that individual's response to a foreign compound 
or drug) may be considered. Differences in metabolism of therapeutics can lead to 
severe toxicity or therapeutic failure by altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, a physician or clinician may 
consider applying knowledge obtained in relevant pharmacogenomics studies in 

25 determining whether to administer a FDRG molecule or FDRG modulator as well as 
tailoring the dosage and/or therapeutic regimen of treatment with a FDRG molecule or 
FDRG modulator. ' ; 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected 

30 persons. See e.g., Eichelbaum, M., Clin Exp Pharmacol Physiol, 1996, 23(10-1 1) :983- 
985 and Linder, M.W., Clin Chem, 1997, 43(2):254-266. In general, two types of 
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pharmacogenetic conditions can be differentiated. Genetic conditions transmitted as a 
single factor altering the way drugs act on the body (altered drug action) or genetic 
conditions transmitted as single factors altering the way the body acts on drugs (altered 
drug metabolism). These pharmacogenetic conditions can occur either as rare genetic 
5 defects or as naturally-occurring polymorphisms. For example, glucose-6-phosphate 
dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the 
main clinical complication is haemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

One pharmacogenomics approach to identifying genes that predict drug 

1 0 response, known as "a genome-wide association", relies primarily on a high-resolution 
map of the human genome consisting of already known gene-related markers (e.g., a "bi- 
allelic" gene marker map which consists of 60,000-100,000 polymorphic or variable 
sites on the human genome, each of which has two variants.) Such a high-resolution 
genetic map can be compared to a map of the genome of each of a statistically 

1 5 significant number of patients taking part in a Phase II/III drug trial to identify markers 
associated with a particular observed drug response or side effect Alternatively, such a 
high resolution map can be generated from a combination of some ten-million known 
single nucleotide polymorphisms (SNPs) in the human genome. As used herein, a 
"SNP" is a common alteration that occurs in a single nucleotide base in a stretch of 

20 DNA. For example, a SNP may occur once per every 1000 bases of DNA. A SNP may 
be involved in a disease process, however, the vast majority may not be disease- 
associated. Given a genetic map based on the occurrence of such SNPs, individuals can 
be grouped into genetic categories depending on a particular pattern of SNPs in their 
individual genome. In such a manner, treatment regimens can be tailored to groups of 

25 genetically similar individuals, taking into account traits that may be common among 
such genetically similar individuals. 

Alternatively, a method termed the "candidate gene approach", can be utilized to 
identify genes that predict drug response. According to this method, if a gene that 
encodes a drugs target is known (e.g., a FDRG protein or FDRG receptor of the present 

30 invention), all common variants of that gene can be fairly easily identified in the 
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population and it can be determined if having one version of the gene versus another is 
associated with a particular drug response. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
major determinant of both the intensity and duration of drug action. The discovery of 

5 genetic polymorphisms of drug metabolizing enzymes (e.g. , N-acetyltransferase 2 (NAT 
2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated 
drug response and serious toxicity after taking the standard and safe dose of a drug. 
These polymorphisms are expressed in two phenotypes in the population, the extensive 

10 metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 
different populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently 
experience exaggerated drug response and side effects when they receive standard doses. 

15 If a metabolite is the active therapeutic moiety, PM show no therapeutic response, as 
demonstrated for the analgesic effect of codeine mediated by its C YP2D6-formed 
metabolite morphine. The other extreme are the so called ultra-rapid metabolizers who 
do not respond to standard doses. Recently, the molecular basis of ultra-rapid 
metabolism has been identified to be due to CYP2D6 gene amplification. 

20 . Alternatively, a method termed the "gene expression profiling", can be utilized to 

identify genes that predict drug response. For example, the gene expression of an , 
animal dosed with a drug (e.g. , a FDRG molecule or FDRG modulator of the present 
invention) can give an indication whether gene pathways related to toxicity have been 
turned on. 

25 Information generated from more than one of the above pharmacogenomics 

approaches can be used to determine appropriate dosage and treatment regimens for 
prophylactic or therapeutic treatment an individual. This knowledge, when applied to 
dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus 
enhance therapeutic or prophylactic efficiency when treating a subject with a FDRG 

3 0 molecule or FDRG modulator, such as a modulator identified by one of the exemplary 
screening assays described herein. 
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This invention is further illustrated by the following examples which should.not 
be construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this application are hereby incorporated by reference. 
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Example 1: Isolation And Characterization of Human FDRG cDNAs 

In this.example, the isolation and characterization of the gene encoding human 
5 FDRG (also referred to as , TDRG H ) is described. / 

Isolation of human FDRG cDNA 

A human FDRG cDN A was isolated from a human aortic endothelial cDNA 
library. To construct the library, three micrograms of poly A + RNA were isolated from 

10 human aortic endothelial cells, reverse transcribed, and used to synthesize, a cDNA 
library using the Superscript cDNA Synthesis kit™ (Gibco BRL; Gaithersburg, MD). 
. Complementary DNA was directionally cloned into the expression plasmid pMET7 
using the Sail and NotI sites in the polylinker to construct a plasmid library. < ■ 
Transformants were picked and amplified for single-pass sequencing. Additionally, 

1 5 human aortic endothelial cDN A was ligated into the Sall/NotI sites of the ZIPLOX™ 
vector (Gibco BRL) for: construction of a lambda phage cDNA library. 

A partial cDNA clone encoding FDRG was identified from the above-described 
cDNA library using the following method. First, each sequence was checked to 
determine if it was a bacterial, ribosomal, or mitochondrial contaminant. Such '., 
* 20 sequences were excluded from the subsequent analysis. Second, sequence artifacts, such 
as vector and repetitive" elements, were masked and/or removed from each sequence. 
Third, the remaining sequences were searched against a copy of the GenBank nucleotide 
database using the BLASTN™ program (BLASTK 1.3MP: Altschul et al 9 J. Mol Bio. 
215:403, 1990). Fourth, the sequences were analyzed against a non-redundant protein 

25 database with the BLASTX™ program, which translates a nucleic acid sequence in all 
six frames and compares it against available protein databases (BLASTX 
1 .3MP:Altschul et of:, supra). This protein database is a combination of the Swiss-Prot, 
PIR, and NCBI GenPept protein databases. 
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The original first pass sequence of the partial cDNA clone showed homology to 
angiopoietin 2 using the BL ASTX™ program. Additional screening of the phage library 
led to the isolation of a full length clone for human FDRG. The nucleotide sequence 5 
and predicted amino acid sequence are shown in Figure 1 (corresponding to SEQ ID 
5 NO: 1 and SEQ ID NO:2, respectively.) The FDRG protein (corresponding to amino , 
acids 1-406 of the predicted amino acid sequence, SEQ ID NO:2) shows 31 .0% identity 
to the human ficolin and 29.0% identity to human angiopoietin 2 (see Figure 3). Percent 
identity was calculated using the Gap program from GCG (Wisconsin Genetics 
Computer Group) which incorporates the Needleman-Wunsch alignment algorithm, 

10 Needleman and Wuhsch (1970)7. Uol Biol 48:444-453, using a gap weight of 12 anda 
length weight of 4. • T 

This human FDRG protein contains a C-terminal fibrinogen-like domain 
(corresponding to amino acids 186-399 of the predicted amino acid sequence, SEQ ID 
NO:2 and amino acids 16 1-376 of SEQ ID NO:5) and a signal sequence (corresponding 

1 5 to amino acids 1 -25 of the predicted amino acid sequence, SEQ ID NO:2) which is 

cleaved to form a mature FDRG protein (corresponding to amino acids 1-381 of SEQ ID 
NO:5). , . 

Example 2: Distribution of FDRG mRNA In Human Tissues 
20 Northern Blot Analysis 

The expression of FDRG was analyzed using Northern blot hybridization. For 
analysis of human FDRG, the entire FDRG insert was isolated from the plasmid 
following Sall/NotI digestion and used as a probe! The probe DNA was radioactively 
labeled with 32 P-dCTP using the Prime-It kit™ (Stratagene, La Jolla, CA) according to 
25. the instructions of the supplier. Filters containing human mRNA (MTNI and MTNII 
from Clontech, Palo Alto, CA; ) were probed in ExpressHyb™ hybridization solution 
(Clontech) arid washed at high stringency, according to manufacturer's 
recommendations. 
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An approximately 2.0 kb FDRG transcript was observed at variable levels in all 
tissues examined (spleen, thymus, prostate, testes, ovary, small intestine, colon, heart, ; 
brain, placenta, lung, liver, skeletal muscle, kidney and pancreas) with the sole 
exception of peripheral blood leukocytes. The highest levels of FDRG expression were 
5 observed in the placenta. 

Example 3: Screening of Mouse Tissues for FDRG Binding Sites 

This Example describes the use of cell supernatant containing AP-hFDRG to 
screen tissue sections of mouse embryos and of whole mice for Tl 1 5 binding sites. 

10 Screening of tissue section with supernatant containing alkaline phosphatase 

(AP) fusion protein was done as described by Cheng and Flanagan (1994) Cell 79:157- 
168. Briefly, fresh frozen sections (8 ^im) of mouse embryos (day 14.5 of prenatal 
development, E14.5) and of whole mice (postnatal day 1.5, PI. 5) were prepared and 
rinsed in HBHA (Hank's balanced salt solution supplemented with 20 inM Hepes, pH 7, 

15 0.05% BSA and 0.1% sodium azide). Subsequently tissue sections were incubated for 1 
h at room temperature with supernatant containing AP-hFDRG (alkaline phosphatase 
fused to the N-terminus of human FDRG, respectively) or AP at a concentration of , 
5nM. After the incubation, tissue sections were washed six times in HBHA, fixed in a 
solution containing 60% acetone, 3% formaldehyde and 20 mM Hepes, pH 7.5, washed 

20 three times in HBS (20 mM Hepes, pH 7.5, 150 mM NaCl) and heated for 30 min at 

65°C to inactivate endogenous alkaline phosphatase activity. Bound AP fusion protein 

was detected by developing sections in BCIP/NBT substrate solution ( 1 00 mM Tris- 

HC1, pH 9.5, 100 mM NaCl, 5 mM MgCl, 0.17 mg/ml BCIP and 0.33 mg/ml NBT). 

AP-hFDRG but not AP protein bound to the epidermis in both E 14.5 and P 1 .5 
■ * .... 

25 sections and to adipose tissue in P 1 .5 sections. Binding of APhFDRG but not AP was 

also observed to epidermis and adipose tissue of adult mice. These results suggest that 

* cells in epidermis and adipose tissue harbour binding sites for FDRG. 
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Example 4: Isolation and Characterization of Murine FDRG cDNAs 

A murine FDRG cDNA (also referred to herein as "BK89V) was cloned by a 
PCR-based subractive hybridization method. NIH3T3 fibroblasts were stably infected 
with either the virus carrying pBABE-PPARy or the virus carrying an empty pBABE 

5 vector. The two cell lines were treated with 1 0 pg/ml pioglitazone, a PPARyligand, for 
a period ofthree hours, and PolyA+RNA was isolated from the cells. The double 
stranded cDNA resulting from reverse transcription was then digested and used for 
multiple rounds of subtractive hybridization and PCR amplification based on a modified 
representational display analysis protocol. The subtracted library was then screened by 

10 slot blot analysis for differentially regulated genes, and candidates were confirmed by 
northern blotting. FDRG was one of the first ten or so clones to be isolated from this 
screen. : / • 

The full-lengthcione of murine FDRG was obtained from a 3T3-F442A 
adipocyte JtZAPII cDNA library. An open reading frame encoding 410 amino acid 

1 5 residues was identified. Hydrophobic sequences are present in the N-terminal regions of 
murine FDRG, consistent with secretory signal peptides. Sequence homology searches 
revealed that the C-terminal half of murine FDRG, displays strong similarity to a family 
of proteins sharing the so-called fibrinogen-like motif, such as angibpoietins- 1 ,2, 
fibrinogen a, p, and y chains, tenascin, ficolin, and pT49. The N-terminal half is 

20 predicted to form a coiled-coil domain. The murine FDRG protein also contains three 
potential N-glycosylation sites, and two cysteines that may be available for 
intramolecular disulfide bonding. * 

Example 5: Secretion of Murine FDRG From COS7 Cells 
25 Vectors encoding untagged and tagged murine FDRG were constructed as 

follows. Briefly, forward and reverse primers were synthesized and used to amplify the 
full-length open reading frame of FDRG from pBluescript SK(+/-) by PCR. 5' primers 
incorporate the native 5 ' untranslated region upstream of the initiator methionine and 3 ' 
primers include the nucleotide sequence encoding the YPYDVPDYA (hemagglutinin, 
30 HA) epitope (SEQ ID NO:26) followed by a termination codon. 
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For untagged FDRG, the following primers were used: 

5' primer - AATTAACCCTCACTAAAGGG (SEQ ID NO: 1 8) 
3' primer - ACGCGTCGACTAATACGACTCACTATAGGGCG (SEQ ID NO: 19) 
For tagged FDRG, the following primers were used: 
5 5' primer -AATTAACCCTCACTAAAGGG < 

3 r primer - TACGCGTCGACCTAAGCGTAGTCTGGGACGTC (SEQ ID NO:20) 
GTATGGGTAAGAGGCTGCTGTAGCCTCCAT (SEQ ID 1^0:21) 
Following amplification, The PCR products were cut with EcoRI/Sal I and were * 
cloned into EcoRI/XhoI cut pcDNA 3. 
10 ■ HA-tagged murine FDRG (in pcDNA) was expressed in COS7 cells using * 
transient transfection and detected the protein in the conditioned media by ' 
immunoblotting. To enhance the. visibility of the signal, the conditioned media was 
concentrated ten-fold before loading by centrifugation in a 1 OkDa MW cutoff spin V 
column (MSI). The epitope-tagged protein produced by the cells was observed to have a 
15 slightly higher molecular weight than the in v/rro translate, consistent with 
postradiational modifications such as glycosylation. 

Example 6: Generation and Characterization of Epitope-Tagged FDRG Proteins 
Expression of Epitope-Tagged human FDRG 

20 A human FDRG flag epitope-tagged protein (huFDRG:flag) vector ^was 

constructed by PCR followed by ligation into an' expression vector, pMET stop. The 
full-length open-reading frame was PCR amplified using a 5' primer incorporating a * 
Kozak sequence upstream of and including the initiator methionine and a 3' primer 
including the nucleotide sequence encoding the DYKDDDDK flag epitope (SEQ ID 

25 NO:23) followed by a termination codon. The primer sequences are shown below: 
5' primer: 5' TTTTTCAATTGACCGCCATGAGCGGTGCTCCGACG 3 ? (SEQ ID 

NO:24) : , 

3' primer: 5* 

TTTTTGTCGACTTATCACTTGTCGTCGTCGTCCTTGTAGTCGGAG 
30 GCTGCCTCT-GCTGC 3' (SEQ ID NO:25) 
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The sequenced DNA construct was transiently transfected into HEK 293T cells 
in 1 50 mm plates using Lipofectamine (GIBCO/BRL) according to the manufacturer's 
protocol. Seventy-two hours post-transfection, the serum-free conditioned medium 
(OptiMEM, Gibco/BRL) was harvested and spun. 
5 Conditioned medium was electroblotted onto a PVDF membrane (Novex) after 

: separation by SDS-PAGE on 4-20% gradient gels and probed with an M2 anti-flag 
polyclonal antibody (1:500, Sigma) followed by horseradish peroxidase (HRP) 
: conjugated sheep anti-mouse antibody (1 :5000, Amersham), developed using 
chemiluminescent reagents (Renaissance, Dupont), and exposed to autoradiography film 
10 (Biomax MR2 film, Kodak). Flag immunoreactivity appeared as a major band that 
migrated at an apparent molecular weight of 

42 kDa as determined by MultiMark molecular weight markers (Novex) by SDS- 
PAGE. 

15 Expression of Epitope-Tagged murine FDRG 

A murine HA-tagged FDRG expression construct was made as described in ^ 
Example 5. The DNA construct was sequenced and then transiently transfected into 
HEK 293T cells in 150 mm plates using Lipofectamine (GIBCO/BRL) according to the 
manufacturer's protocol. Seventy-two hours post-transfection, the serum-free 

20 conditioned medium (OptiMEM, Gibco/BRL) was harvested and spun. 

Conditioned medium was electroblotted onto PVDF (Novex) after separation by SDS- 
PAGE on 4-20% gradient gels and probed with an anti-HA monoclonal antibody, HA.l 1 
(1 :500i Babco) followed by HRP conjugated sheep anti-mouse antibody (1 :5000, 
Amersham), developed using chemiluminescent reagents (Renaissance, Dupont), and 

25 exposed to autoradiography film (Biomax MR2 film, Kodak). HA immunoreactivity 
appeared as a major band that migrated at 60 kDa as determined by MultiMark markers 
(Novex) by SDS-PAGE. 
A second fainter band was seen at 42 kDa. 
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Mock Transfected Supernatant 

A mock supernatant was prepared by transfecting 293T cells with a pMET stop 
vector containing no insert. Seventy-two hours post-transfection, the serum-free 
conditioned medium (OptiMEM, Gibco/BRL) was harvested and spun. Conditioned 
5 media was electroblotted, developed, and exposed as described above. No specific 
bands were detected on either anti-flag or anti-HA blots. 

Example 7: Characterization of anti-Human TANGO 115 Polyclonal Antibodies 

A synthetic peptide corresponding to the C-terminus of human TANGO 115 was 

10 prepared with the following sequence: KTWRGRYYPLQATTMLIQPMAA (SEQ ID 
NO:27); This was conjugated to KLH (keyhole limpet hemocyanin) at the N-terminal 
lysine and injected into two rabbits for the production of polyclonal antiserum using 
standard immunization protocols (e.g., Harlow and Lane, 1987). 

A synthetic peptide corresponding to the N-terminus of human TANGO 1 15 was 

15 prepared with the following sequence: PVQSKSPRFASWDEMN (SEQ ID NO:28). . 
This was synthesized with a MAP core (Research Genetics, Rockville MD) at the C- 
terminal asparagine. It was injected into two rabbits for the production of polyclonal 
antiserum using standard immunization protocols (e.g., Harlow and Lane, 1987). 
, /Antiserum samples were harvested 10 weeks after the initial immunization. 

20 Immurioreactivity was tested by ELISA against the immunizing peptide and, were 
positive as compared to pre-immune serum bleeds. Replicate samples of recombinant 
human FDRG:flag and mouse FDRG:HA protein supernatants and mock transfected 
negative control supernatants prepared as described above were run on SDS-PAGE and 
electroblotted onto PVDF membrane (Millipore). 1 : 1 000 dilutions of all four rabbit 

25 antisera were used on replicate blots as primary antiserum followed by anti-rabbit HRP 
conjugated secondary antibodies. Blots were developed using chemiluminescent 
reagents (Renaissance, Dupont), and exposed to autoradiography film (Biomax MR2 
film, Kodak). 

Immunospecificity of the rabbit antisera were tested by standard Western blot , 
30 methods. Antisera from the two rabbits immunized with the C-terminal peptide labeled 
a band in lanes containing human FDRG flag tagged at the C-terminus that was also 
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immunoreactive with anti-flag M2 antibodies in replicate blots. The approximate 
molecular weight of this band was 40 (+/-5) kDa as determined using MultiMark 
standards (Novex). This band was not seen in lanes containing supernatant from mock 
' or mouse FDRG HA transfected cells. 
5 Antisera from rabbits immunized with the N-terminal peptide did not show any 

specific bands in lanes containing either human or mouse FDRG protein when compared 
to lanes containing mock transfected samples, indicating that by Western blot analysis 
these antibodies do not interact with FDRG protein. 

10 Example 8: Tissue Expression of Murine FDRG 

Total RNA was isolated from mouse liver, muscle, heart, spleen, white adipose, 
brown adipose, kidney, testes, brain, and lung tissue samples of 9 week old male 
C57BKFJ mice and analyzed for FDRG expression by northern blotting. The tissue 
distribution of FDRG mRNA is highly adipose tissue-selective, with at least 10-20 fold 
1 5 higher expression in white and brown fat over other tissues examined. Low levels of 
. «- expression were detectable in other tissues (e.g., liver) and is believed to be due to fat 
. . contamination. / 

Example 9: In vitro Regulation of murine FDRG in PPARy-E xpressing Cells 

20 NIH3T3 fibroblasts stably expressing PPARy were treated with 1 0 jig/ml 

pioglitazone for 0, 2, and 4 hours. Northern blot analysis performed on the cellular 
RNA shows that FDRG mRNA is only barely detectable in untreated cells but 
undergoes at least a 1 0-fold increase within two hours after pioglitazone administration. 
Inhibition of protein synthesis by pre-treatment with 5 jig/ml cycloheximide does not 

25 block this induction. Cycloheximide by itself causes a rise in FDRG mRNA, a 
t. * < . 

phenomenon often associated with immediate early genes and generally attributed'to 
« * ' 
enhanced mRNA stability. The increases in FDRG message levels produced by 

pioglitazone and cycloheximide are additive, presumably reflecting independent - 

'* mechanisms of regulation. 
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Example 10: In vivo Regulation of murine FDRG in Zucker Diabetic Fatty Rats 

Male Zucker diabetic fatty (fa/fa) rats were treated with 6 mg/kg/day troglitazone 
admixed with food starting at 6 weeks of age and continuing until 26 weeks. By this 
time untreated littermates develop severe hyperglycemia and failure of the pancreatic P- 
5 cell function, while the treated animals remain normoglycemic. Northern blots were 
performed on RNA isolated from white adipose tissue of treated and untreated animals, 
five from each group. FDRG expression in vivo was observed to be consistently 
upregulated 3-5 fold in the treated animals compared to the controls. Glut4 mRNA was 
also found to be elevated, in agreement with published reports by other groups. 

1 0 Expression of a control mRNA, 36B4, was unaffected by troglitazone treatment.. 

.._* ... 

Example 11: Regulation of FDRG in Murine Models of Obesity 

FDRG expression was exaniined in white and brown fat tissues from 9 week old 
male obese (ob/ob) and diabetic (db/db) mice. Northern analysis showed uniform 

1 5 elevation of FDRG mRNA in ob/ob and db/db animals compared to their respective 
congenic lean controls, both in white and brown fat. This may reflect a correlation of 
increased FDRG expression with obese states in general, raising the possibility that 
FDRG may contribute to some of the pathophysiological features of such states. 
Alternatively, FDRG mRNA levels may be regulated directly or indirectly by the 

20 genetic defects specific for these two models, i.e. leptin and the leptin receptor. 

Example 12: Regulation of murine FDRG by Nutrition and Leptin Administration 

Lean mice were divided into three experimental groups, and were either allowed 
free access to food or subjected to 12 hours of fasting or 7.5 hours of fasting followed by 
25 refeeding (4.5 hours). The animals were sacrified at the end of 12 hours. Northern blot 
reveals a 2-3 fold increase in FDRG mRNA in white adipose tissue after a short-term 
fast, which is abrogated by refeeding. This suggests that FDRG levels are responsive to 
acute metabolic changes in vivo. 

In a second study, 1 0 week old lean or ob/ob mice were divided into three 
30 groups. The first group was the control, intraperitoneally injected with 1 .5 ml PBS 
twice daily. The second group was injected with 12.5 mg/kg leptin in PBS twice daily, 



10 
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with consequent decrease in dietary intake. The third group, the pair-fed group, was diet 
restricted to match the amount of food intake by the leptin-injected group, and was also 
injected with PBS. The animals were sacri'fiedat the end of one week. FDRG 
expression in brown fat was seen to go up substantially with restrictions of caloric 
intake, consistent with the findings in the first study, but this rise is not seen in leptin 
treated animals. That may reflect a fundamental difference in leptin-induced dietary 
restriction versus externally imposed restriction, or may be consistent with a further 
downstream effect, e.g., leptin-mediated suppression of FDRG upregulation that would 
ordinarily result from reduced food intake. « 



Example 13: Tissue Expression of human FDRG t . 

Northern blot analysis of RNA isolated from human placenta, heart, kidney, 
liver, and fat tissue samples showed high expression in white fat, in agreement with the 
murine FDRG tissue expression data, and in placenta. Placental expression is of interest 
1 5 because it is a tissue known to undergo extensive vascular remodelling in the adult (e.g. , 
in its development during pregnancy) and involved in transport of metabolic substrates 
for the fetus. Another fat-derived secreted protein with important metabolic and 
endocrine functions, leptin, is also produced in abundant quantities in placenta. 

20 Example 14: Expression of Recombinant FDRG Protein in Bacterial Cells 

FDRG can be expressed as a recombinant glutathione-S-transferase (GST) 
fusion polypeptide in E. coli and the fusion polypeptide can be isolated and 
characterized. Specifically, FDRG is fused to GST and this fusion polypeptide is 
expressedin £ coli, e.g., strain PEB199. Expression of the GST-FDRG fusion protein 
25 in PEB199 is induced with IPTG. The recombinant fusion polypeptide is purified, from 
crude bacterial lysates of the induced PEB199 strain by affinity chromatography on 
glutathione beads. Using polyacrylamide gel electrophoretic analysis of the 
polypeptide purified from the bacterial lysates, the molecular weight of the resultant 
fusion polypeptide. is determined. 



30 
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Example 15; Expression of Recombinant FDRG Protein in COS Cells 

To express the FDRG gene in COS cells, the pcDNA/Amp vector by Invitrogen 
Corporation (San Diego, CA) is used. This vector contains an SV40 origin of 
replication, an ampicillin resistance gene, an E. coli replication origin, a CMV promoter 
5 followed by a polylinker region, and an SV40 intron and polyadenylation site. A DNA 
fragment encoding the entire FDRG protein and a HA tag (Wilson et at, (1984) Cell 
37:767) fused in-frame to its 3' end of the fragment is cloned into the polylinker region 
of the vector, thereby placing the expression of the recombinant protein under the 
control of the CMV promoter. 

1 0 To construct the plasmid, the FDRG DNA sequence is amplified by PCR using 

two primers. The 5' primer contains the restriction site of interest followed by , 
approximately twenty nucleotides of the FDRG coding sequence starting from the 
initiation codon; the 3' end sequence contains complementary sequences to the other 
restriction site of interest, a translation stop codon, the HA tag and the last 20 

1 5 nucleotides of the FDRG coding sequence. The PCR amplified fragment and the 
pCDNA/Amp vector are digested with the appropriate restriction enzymes and the 
vector is dephosphoiylated using the CIAP enzyme (New England Biolabs, Beverly, 
MA). Preferably the two restriction sites chosen are different so that the FDRG gene is 
inserted in the correct orientation. The ligation mixture is transformed into £ coli cells 

20 (strains HB101, DH5a, SURE, available from Stratagene Cloning Systems, La Jolla, 
CA, can be used), the transformed culture is plated on ampicillin media plates, and 
resistant colonies are selected. Plasmid DNA is isolated from transformants and 
examined by restriction analysis for the presence of the correct fragment 

COS cells are subsequently transfected with the FDRG-pcDNA/Amp plasmid 

25 DNA using the calcium phosphate or, calcium chloride co-precipitation methods, DEAE- 
dextran-mediated transfection, lipofection, or electroporation. Other suitable methods 
for transfecting host cells can be found in Sambrook, J., Fritsh, E. F., and Maniatis, T. 
Molecular Cloning: A Laboratory Manual. 2nd, ed t Cold Spring Harbor Laboratory, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. The expression 

30 of the FDRG protein is detected by radiolabelling ( 35 S-methionine or 35 S-cysteine 
available from NEN, Boston, MA, can be used) and immunoprecipitation (Harlow, E. 
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and Lane, D. Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Gold Spring Harbor, NY, 1988) using an HA specific monoclonal antibody. Briefly, the 
cells are labeled for 8 hours with 35 S-methionine (or 35 S-cysteine). The culture media 
are then collected and the cells are lysed using detergents (RIPA buffer, 1 50 mM NaCl, 

5 1 % NP-40, 0.1% SDS, 0.5% DOC, 50 mM Tris, pH 7:5). Both the cell lysate and the 
culture media are precipitated with an HA specific monoclonal antibody. Precipitated 
proteins are then analyzed by SDS-PAGE. v 

Alternatively, DNA containing the FDRG coding sequence is cloned directly 
into the polylinker of the pCDNA/Amp vector using the appropriate restriction sites. 

10 The resulting plasmid is transfected into COS cells in the manner described above, and 
the expression of the FDRG protein is detected by radiolabelling and . 
immunoprecipitation using a FDRG specific monoclonal antibody 

Example 16: Retroviral Delivery of FDRG \ 

15 The entire open reading frame of FDRG can be subcloned into the retroviral 

■ ■ vector MSCVneo, described in Hawley et al (1994) Gene Therapy l:\36-Ufl. Cells 
- (293Ebna, Invitrogen) are then transiently transfected with the FDRG construct and with 
constructs containing viral regulatory elements, to produce high titre retrovirus 
containing the FDRG gene. The virus is then used to transfect mice. These mice are 
20 -then tested for any gross pathology and for changes in biological response, e.g., cell 
migration, using standard assays. 

• • ' i ".. ■• .. '. " 

- Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
25 * routine experimentation, many equivalents to the specific embodiments of . the invention 
' described herein. Such equivalents are intendeds be encompassed by the following 
claims. , 
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1 . An isolated nucleic acid molecule selected from the group consisting of: 

a) a nucleic acid molecule comprising a nucleotide sequence which is at least 
60% homologous to a nucleotide sequence of SEQ ID NO: 1 , SEQ ID NO:3, 
SEQIDNO:4,SEQIDNO:ll,SEQIDNO:13,SEQIDNO:14,theDNA 
insert of the plasmid deposited with ATCC as Accession Number 98632, or a 
complement thereof;' * 

b) a nucleic acid molecule comprising a fragment of at least 555 nucleotides of 
a nucleic acid comprising the nucleotide sequence of SEQ ID NO:l, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:l 1, SEQ ID NO:13, SEQ ID NO: 14, the 
DNA insert of the plasmid deposited with ATCC as Accession Number 
98632, or a complement thereof; 

c) a nucleic acid molecule which encodes a polypeptide comprising an amino 
acid sequence at least about 60% homologous to the amino acid sequence of 
SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:12, SEQ ID NO:15, or an amino 
acid sequence encoded by the DNA insert of the plasmid deposited with 
ATCC as Accession Number 98632; 

d) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ 
ID NO: 1 2, SEQ ID NO: 1 5, or the polypeptide encoded by the DNA insert of 
the plasmid deposited with ATCC as Accession Number 98632, wherein the 
fragment comprises at least 15 contiguous amino acid residues of the amino 
acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:12, SEQ ID 
NO: 15, or the polypeptide encoded by the DNA insert of the plasmid 
deposited with ATCC as Accession Number 98632; and . 

e) a nucleic acid molecule which encodes a naturally occurring allelic variant of 
a polypeptide comprising the amino acid sequence of SEQ ID NO: 2, SEQ 
ID NO:5, SEQ ID NO: 12, SEQ ID NO: 15, or an amino acid sequence 
encoded by the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98632, wherein the nucleic acid molecule hybridizes to a 
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nucleic acid molecule comprising SEQ ID NO: l, SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO: 1 1 , SEQ ID NO: 13. or SEQ ID NO: 1 4, under stringent 
conditions. 

5 2. The isolated nucleic acid molecule of claim 1 which is selected from the 

group consisting of: 

a) a nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
NO: 1 , SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO: 1 1 , SEQ ID NO: 1 3, SEQ 
ID NO: 1 4, or the DNA insert of the plasmid deposited with ATCC as 

10 Accession Number 98632, or a complement thereof; and 

b) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:12, SEQ ID 
NO: 15, or an amino acid sequence encoded by the DNA insert of the plasmid 
deposited with ATCC as Accession Number 98632. 

15 

3. . The nucleic acid molecule of claim 1 further comprising vector nucleic 
acid sequences. 

4. The nucleic acid molecule of claim 1 further comprising nucleic acid 
20 sequences encoding a heterologous polypeptide. 

5. A host cell which contains the nucleic acid molecule of claim 1 . 

6. The host cell of claim 5 which is a mammalian host cell. 
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7. A non-human mammalian host cell containing the nucleic acid molecule 
of claim 1. 
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9 

8. An isolated polypeptide selected from the group consisting of: 

a) a fragment of a polypeptide comprising the amino acid sequence of SEQ ID 
NO:2, SEQ ID NO:5, SEQ ID NO: 12, SEQ ID NO: 1 5, or the polypeptide 
encoded by the DNA insert of the plasmid deposited with ATCC as 

5 * Accession Number 98632, wherein the fragment comprises at least 1 5 

contiguous amino acids of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:12, 
' ■ - * SEQ ID NO: 1 5, or the amino acid sequence encoded by the DNA insert of 

the plasmid deposited with ATCC as Accession Number 98632; 

b) a naturally occurring allelic variant of a polypeptide comprising the amino 
10 , acidsequenceofSEQIDNO:2,SEQIDNO:5,SEQIDNO:12,SEQID 

NO: 1 5, or an amino acid sequence encoded by the DNA insert of the plasmid 
" " J ■ ; deposited with ATCC as Accession Number 98632, wherein the polypeptide 
is encoded by a nucleic acid molecule which hybridizes to a nucleic acid 
molecule comprising SEQ ID-NO: 1 9 SEQ ID NO:3, SEQ ID NO:4, SEQ ID 
15 NO: 11, SEQ ID NO: 13, SEQ ID NO: 14, under stringent conditions; and 

c) a polypeptide which is encoded by a nucleic acid molecule comprising a 
nucleotide sequence .which is at least 60% homologous to a nucleic acid 
comprising the nucleotide sequence of SEQ ID NO: 1 , SEQ ID NO:3, SEQ 
IDNO:4,SEQIDNO:tl,SEQIDNO:13,SEQIDNO:14,prtheDNA , 

20 insert of the plasmid deposited with ATCC as Accession Number 98632. 

d) a polypeptide comprising an amino acid sequence which is at least 60% 
homologous to the amino acid sequence of SEQ ID NO:2, SEQ ID NO:5, 
SEQ ID NO: 12, SEQ ID NO:15, or the polypeptide encoded by the DNA 

, * insert of the plasmid deposited with ATCC as Accession Number 98632. 
25.'" . >. 

9. The isolated polypeptide of claim 8 comprising the amino acid sequence 
of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO: 12, SEQ ID NO: 1 5, or an amino acid 
sequence encoded by the DNA insert of the plasmid deposited with ATCC as Accession 

30 Number 98632. 
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10. The polypeptide of claim 8 further comprising heterologous amino acid 
sequences. 

11. An antibody which selectively binds to a polypeptide of claim 8. 

5 

12. A method for producing a polypeptide selected from the group consisting 

of: _ • • 5 . 

a) a polypeptide comprising the amino acid sequence of SEQ ID NO: 2, SEQ 
ID NO:5, SEQ ID NO: 12, SEQ ID NO:15, or an amino acid sequence • 

10 encoded by the DNA insert of the plasmid deposited with ATCC as 

. Accession Number 98632; 

b) a fragment of a polypeptide comprising the amino acid sequence of SEQ ID ' 
NO:2, SEQ ID NO:5, SEQ ID NO:12 or SEQ ID NO: 1 5, or an amino acid 
sequence encoded by the DNA insert of the plasmid deposited with ATCC as 

15 Accession Number 98632 wherein the fragment comprises at least 15 \ 

contiguous amino acids of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:12, 
SEQ ID NO: 15, or the amino acid sequence encoded by the DNA insert of 
the plasmid deposited with ATCC as Accession Number 98632; and ..... 

c) a naturally oMuiringalldicvariart 

20 acidsequenceofSEQIDNO:2,SEQIDNO:5,SEQIDNO:12,SEQID 

NO:15, or an amino acid sequence encoded by the DNA insert of the plasmid 
deposited with ATCC as Accession Number 98632, wherein the polypeptide 
is encoded by a nucleic acid molecule which hybridizes to a nucleic acid 
molecule comprising SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID 

25 I NO:ll, SEQ ID NO:13, or SEQ ID NO:14, under stringent conditions; 

comprising culturing the host cell of claim 5 under conditions in which the nucleic acid 
molecule is expressed. 
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13 . A method for detecting the presence of a polypeptide of claim 8 in a 
sample comprising: 

a) contacting the sample with a compound which selectively binds to the 
polypeptide; and 

5 b) determining whether the compound binds to the polypeptide in the sample to 

thereby detect the presence of a polypeptide of claim 8 in the sample. 

14. The method of claim 13, wherein the compound which binds to the 
polypeptide is an antibody. 

10 

15. A kit comprising a compound which selectively binds to a polypeptide of 
claim 8 and instructions for use. 

16. A method for detecting the presence of a nucleic acid molecule in claim 1 

15 in a sample comprising: 

a) contacting the sample with a nucleic acid probe or primer which selectively 

hybridizes to the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to a nucleic acid 
molecule in the sample to thereby detect the presence of a nucleic acid 

20 molecule of claim 1 in the sample. * 

17. The method of claim 16, wherein the sample comprises mRN A 
molecules and is contacted with a nucleic acid probe. 

25 18. A kit comprising a compound which selectively hybridizes to a nucleic 

acid molecule of claim 1 and instructions for use. 
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19. A method for identifying a compound which binds to a polypeptide of 

claim 8 comprising: ' \ 

a) contacting the polypeptide, or a cell expressing the polypeptide with a 

test compound; and 

5 : b) determining whether the polypeptide binds to the test compound. 

20. The method of claim 1 9, wherein the binding of the test compound to the 
polypeptide is detected by a method selected from the group consisting of: 

a) ' detection of binding by direct detection of test compound/polypeptide 
10 binding; 

b) detection of binding using a competition binding assay; and 

c) detection of binding using an assay for FDRG activity. 

21. A method of modulating the activity of a polypeptide of claim 8 
15 comprising contacting the polypeptide or a cell expressing the polypeptide with a 

compound which binds to the polypeptide in a sufficient concentration to modulate the 
activity of the polypeptide. 

22; A method for identifying a compound which modulates the activity of a 
20 polypeptide of claim 8 comprising: 

a) contacting a polypeptide of claim 8 with a test compound; and 

b) determining the effect of the test compound on the activity of the polypeptide 
to thereby identify a compound which modulates the activity of the 
polypeptide. 
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23. ' A method of treating obesity or diabetes in a subject comprising 
administering to the subject an agent selected from the group consisting of a small 
molecule modulator of FDRG, a FDRG nucleic acid molecule, and a FDRG antibody 
such that obesity or diabetes in the subject is treated. 
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24. A method for diagnosing obesity or diabetes in a subject comprising: 

a) isolating a sample from the subject ; 

b) contacting the sample with a nucleic acid probe or primer which selectively 
hybridizes to a FDRG nucleic acid molecule; and 

5 c) determining whether the nucleic acid probe or primer binds to a FDRG 

nucleic acid molecule in the sample to thereby detect the presence of obesity 
or diabetes in the subject. * 

25. A method of for identifying a compound which modulates a metabolic 
10 disorder comprising: 

a) contacting a cell which expresses a FDRG receptor with a compound to form 
an assay mixture: 

b) contacting the assay mixture with FDRG; 

c) determining whether the compound modulates FDRG activity, 
15 such that a modulatory compound is identified. 

26. A method of regulating angiogenesis in a subject comprising 
administering to the subject an agent selected from the group consisting of a small 
molecule modulator of FDRG, a FDRG nucleic acid molecule, and a FDRG antibody 

20 such that angiogenesis in the subject is regulated. 
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Figure J A 

ACTOGACCCMCCGTCCGC^ 80 

«?roCTCOGCGTCTCC^ l5S 

MSBAPTAC AALML CAATA 18 

AAGAGO ATG AGC GGT OCT COS ACQ GCC GGO GCA GCC CTO ATO CTC TOC GCC CCC ACC GCC 219 



s 



38 



GTO CTA CTO AGC GCT CAG OGC GGA CCC GTO CAO TCC AAO TOO COO OGC TTT GCG TCC TGG 279 

n p M H V L A H 0 L It Q L G Q G L R B H; SB 

GAC GAG ATO AAT GTC CTG GCG CAC GGA CTC'CIO CAG CTC GGC CAG. GGG CTG CGC GAA CAC 339 

, ER fRSQI.SALERR L.8 A C O S 78 

GC* gL CGC ACC CGC AGT CAG CTG AGC GCG CTO GAG CGG CGC CTO AGC GCG TOC GGG TCC 399 

' ' _ _ _ - S T D L P L A P E S R V D 98 

OCC TOT CAO GGA ACC GAG GGG TCC ACC GAC CTC CCG TTA OCC CCT GAG AGO CGG GTO GAC 459 

„ '« v L H S L Q T Q L K A 0 N S R I 0 Q 1" 

CCT GAG GTC CTT CAC AGC CIO CAG ACA CAA CTC AAO GCT CAO AAC AGC AGO ATC CAO .CAA 519 

, p h K V A Q Q Q R ' H L B K Q K L R I Q 138 

CTC TTC CAC AAO OTG OCC- CAG CAG CAG CGG CAC CTO GAG AAO CAO CAC CTG CGA ATT CAG 579 

«.«ti.BR ,ICH1»DH BVA K 158 

voLPEMA 'QPVDPA H HV 178 

cctgwcgaacUaagaggciocmgagatg^ 

^CGCCTGcLcGOC^ 

o n G S P P F L V N C K M 218 

AGT GGA CTA TTT GAA ATC CAO CCT CAO GGO TCT CCO CCA TTT TTO GTO AAC TGC AAO ATG .19 

ACC TCA OAT GGA GGC TOG ACA OTA ATT CAG AGO CGC CAC GAT OGC TCA OTO GAC TTC AAC 879 

^CCC^GLGCCTAC^GaC^ "» 

CTO GAG AAO GTO CAT AGO ATC ACO GGG GAC CCC AAC AGC CGC CTO GCC OTG &0 CTO TOG 999 

GAC TGO GAT GGC AAC OCC GAG TTO CTO CM TCC TCC GTO CAC CTO GOT ©OC GAO GAC. ACO 1059 
„ ■ , n f T A P V A 0 Q L O A T T V P 318 

GCC TAT AGC CTG CAG CTC ACT GCA CCC GTG GCC OGC CAG CTO OGC GCC ACC ACC GTC CCA 1119 
e vPPSTH OQDHDL RRD 338 

c4:agcggcct^t!c<2tac«ttctccact^ 

K H C A K S L S G G H. H F 0 T_ _C_ _S_H_ _S_ ^H. Ut 



AAG AAC TGC GCC AAO AGC CTC TCT GGA OGC TOO TOO TTT OOC ACC TOC AGC CAT TCC AAC 1239 
CTC AAC CGC CAG TAC TTC CGC TCC ATC CCA CAG CAG CGG CAG AAO CTT AAO AAO GGA ATC 1299 
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Figure IB 

'* » K T W R C R Y Y P ■ L Q A T T M L I Q 398 
TTC TGG AAC ACC TGG CX3G GGC OGC TAC TAC CCG CTO CAG GCC ACC ACC ATG TTG ATC CAG 1359 

P M A A E A A S , • 
CCC ATC OCA GCA GAG GCA GCC TCC TAG. 

arrCCrGWCTCGGCCTGGTCCCATC 14 5 S 

TCXXSCAGGGGCTOMGGAGGGGCCATCTO^ 1S44 

GAGTOCAGGGGGGCTCCATCCGTTCCCTCCTGAGATCGAGGCTC^ 162 3 

GCKXATCXMGCTraCTCCTO x 70 2 

ATGGCGGACTCAGTCACATTGACTGACXX3GGACCAGGG CITO 1781 

GTOTGTAGGTCCCCTGGGGACACAAGCAGGCG CCAATGGTATCTGGGCGGAG CTCACAGAGTTCTTGGAATAAA I860 

CCTGAGAACAAAAAAAAAAAAAAAGGGGGGCGGC ' • X894 
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Figure 2 



O p V Q S K S P R P A S W 13 

GGA CCC GTG CAG TCC AAG TCG CCG CGC TTT GOG TCC TGG 39 

D E M N V L A H G >L L Q . , L G .Q .. .. G L R E H 33 

GAC GAG ATG AAT GTC CTG GCG CAC GGA CTC CTG CAG CTC GGC CAG GGG CTG CGC GAA CAC 99 

A E R T R S Q I* S A, L E R R 1* S A C G S .53 

GCG GAG CGC ACC CGC ACT CAG CTG AGC GCG CTG GAG CGG CGC CTG AGC GOG TGC GGG TCC 159 

A C Q G TEGS T O L P! t A P E S R V D 73 

GCC TGT CAG GGA ACC GAG GGG TCC ACC GAC CTC CCG TTA GCC CCT GAG AGC CGG GTG GAC 219 

p E v L H S LQT Q L K AQNS R I QQ 93 

CCT GAG GTC CTT CAC AGC CTG CAG ACA CAA CTC AAG GCT CAG AAC AGC AGG ATC CAG CAA , 279 

L F H K V- A 0 Q Q R H L E K Q H I* R I Q 113 

CTC TTC CAC AAG GTG GCC CAG CAG CAG CGG CAC CTG GAG AAG CAG CAC CTG CGA ATT CAG 339 

H L Q S Q F O L L D H.K H L D H E V A K 133 

CAT CTG CAA AGC CAG TTT GGC CTC CTG GAC CAC AAG CAC CTA GAC CAT GAG GTG GCC AAG 399 

PARRKRLP'EMAQPV D PAH NV 153 

CCT GCC CGA AGA AAG AGG CTG CCC GAG ATG GCC CAG CCA GTT GAC CCG GCT CAC AAT GTC 459 

SRLHR LP RDCQB L FQVGERQ 173 

AGC CGC CTG CAC CGG CTG CCC AGG GAT TGC CAG GAG CTG TTC CAG GTT GGG GAG AGG CAG 519 

S GLFBIQPQGS PPFL VN CKM 193 

ACT GGA CTA TTT GAA ATC CAG CCT CAG GGG.TCT CCG CCA TTT TTO GTG AAC TGC AAG ATG 579 



T S D O Q 



W T V I Q R R H D G S V D F K 213 



639 



ACC TCA GAT GGA GGC TGG ACA GTA ATT CAG AGG CGC CAC GAT GGC TCA GTG GAC TTC AAC 

RPHE A YKA GFGD PHGEFWLG 233 

CGG CCC TGG GAA GCC TAC AAG GCG GGG TTT GGG GAT CCC CAC GGC GAG TTC TGG CTG GGT 699 

LE KVHSITG DRMSRLAVQL R 253 

CTG GAG AAG GTG CAT AGC ATC ACQ GGG GAC CGC AAC AGC CGC CTG GCC GTG CAG CTG CGG 759 

D W D G N AE L LQ F S VHLG G B DT 273 

GAC TOG GAT GGC AAC GCC GAG TTC CTG CAG TTC TCC GTG CAC CTG GGT GCC GAG GAC ACG 819 

A Y S L Q L T A P V A G O h G A T T V P 293 

GCC TAT AGC CTG CAG CTC ACT GCA CCC GTG GCC GGC CAG CTG GGC GCC ACC ACC GTC CCA 879 

P SGLS VPFSTM DQDKDLRRD 313 

CCC AGC GGC CTC TCC GTA CCC TTC TCC ACT TGG GAC CAG GAT CAC GAC CTC CGC AGG GAC 939 

KMC A K S h S G G W W F G T C S H S N 333 

AAG AAC TGC GCC AAG AGC CTC TCT GGA GGC TGG TGG TTT GGC ACC TGC AGC CAT TCC AAC 999 



353 
1059 



L NGQYFRS I P QQ RQKLKKGI 
CTC AAC GGC CAG TAC TTC CGC TCC ATC CCA CAG CAG CGG CAG AAG CTT AAG AAG GGA ATC 

F w K T H R CRY Y P L Q A T T M L X Q 373 
TTC TGG AAG ACC TOO CGG GGC CGC TAC TAC CCG CTG CAG GCC ACC ACC ATG TTG ATC CAG 1119 



PMAA EAAS 
CCC ATG GCA GCA GAG GCA GCC TCC 



3B1 
1143 
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Alignment of: 






End 


Sequence type 


Sequence 


Start 


human FDRG 


(1 


> 


406) 


PROTEIN 


murine FDRG 


(1 


> 


410) 


PROTEIN 


hu angiopoeitin-2 


(l 


> 


496) 


PROTEIN 


mu angiopoeitin-2 


(1 


> 


496) 


PROTEIN 


por ficolin 


(1 


> 


326) 


PROTEIN 



human FORG 
murine FDRG 
hu angiopoeitin-2 
mu angiopoeitin-2 
por ficolin 



human FDRG 
murine FDRG 
hu angiopoeitin-2 
mu angiopoeitin-2 
por ficolin 



human FDRG 
murine FDRG 
hu angiopoietin-2 
mu angiopoietin-2 
por ficolin 



human FDRG 
murine FDRG 
hu angiopoietin-2 
mu angiopoietin-2 
por ficolin 



human FDRG 
murine FDRG 
hu angiopoietin-2. 
mu angiopoietin-2 
por ficolin 



human FDRG 
murine FDRG 
hu angiopoietin-2 
mu angiopoietin-2 
por ficolin 



human FDRG 
murine FDRG 
hu angiopoietin-2 
mu angiopoietin-2 
por ficolin 



I 60 

MSGAP--TAGAAIMLCAATA VLLSAQGG PVQSK5 PRF 

MRCAP--TAGAALVLCAAXA : - GLLSAQGRPAQPEPPRF 

MWQI VFFTLSCDLVLAAAYNN FRKSMDS IGKKQ YQVQHGSCS YT FLLPEMDNCRSS SS P Y 
MWQIIFLTFGWDLVLASAYSNFRKSVDSTGRRQYQVQNGPCSYTFLLPETDSCRSSSSPY 
M ELSRVAV- ALGPTGQ « LLL F 

61 

ASW DEMNVLAHG LLQLGQGLREHAERTRSQL — SALERR 

ASW- DEMNLLAHG -LLQLGHGLREHVERTRGQL-- GALERR 

VSNAVQRDAPLEYDDSVQRI^VLENIMENNTQWIJ^KI^NYIQDNMKKEMVEIQQNAVQN 
MSNAVQRDAPLDYDDSVQRLQVLENILENNTQWIJ4KIXNYIQDNMKKEMVEIQQNVVQNQ 
LSFQ 



121 



180 

LSACGSACQGTEGSTDLPLAPESRV DPEVL— HSLQTQLKAQNSR 

MAACGNACQG PKGKDAP FKDS E DRVPEGQT PET L — QS LQTQLKAQNS K 

TAVMIEIGTNLI^QTAEQTRKLTDVEAQVLNQTTRLELQLLEH^ 
TAVMIEIGTSLLNQTMQTRKLTDVEAQVLNQTTRLEI*QLLQHSISTNKLEKQILDC3TSE 
TLAAQAADTCPEVKWG « ' LEGSDK 



181 



240 
-LD 
-LD 



IQQLFHKVA-QQQRRLEKQHLRIQHLQS --QFGLLDHKH 

IQQLFQKVA-QQQRYLSKQNLRIQNLQS QIDLLAPTH • 

INKLQDKNS FLEKKVLAMEDKHI IQLQS IKEEKDQLQVLVSKQNS I IEELEKKI VTATVN 
INKLQNKNS FLEQKVLDMEGKHSEQLQSMKEQKDELQVLVSKQSSVI DELEKKLVTATVN 
LSILRG CPGLPGAAGPKGEAGANGPKGERGSPGWGKAGPAGPK 



241 



* 300 

HEVAKPARRKRLPEKAQPVDPAHNVSRLHRXP" >---R DCQBZx w'<*IMQSGXJEI 

NGVDKTSRGKRLPKMTQLIGLTPNATHLHRPP RDCQBXF&GEt^GLStt 

NSVW>KQQHDI*CTV^ 

GDRGEKGARGEKGEPGQLQSCATGP BTCKELLTBGBFLSGWBTI 

R C E G G 

301 * 360 

Q-2QGSPBFLVRCtoi-TSDGGmVIQBBEDGS 

C M GGWT Q R GSVDF W YK GFG E WLG 

361 420 

ESIIXSDRWSRlJlVDIOTWDGNftK^ 

BSXTGimGSQLXVQLQEmcmKL^ 

T L LD GH E 
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Figure 4 A 



79 



M R 

GCTGCTGGGTCTTGAACTCCTG^XTITCCGGAGTCCTAGCG ATG CGC 



2 

154 



CAPTAG AALV XCAATAGLLS 22 

TGC GCT CCG ACA GCA GGC GCT GCC CTG GTG CTA TGC GCG GCT ACT GCG GGG CTT TTG AGC 214 

A Q G R ' P A j Q P £ P P R F A S W 0 E H N 42 

GCG CAA GGG CGC CCT GCA CAG CCA GAG CCA CCG CGC TTT GCA TCC TGG GAC GAG ATG AAC 274 



L L A H G L L , Q L G H G L R E K V E R T 
TTG CTG GCT CAC GGG CTG CTA CAG CTC GGC CAT GGG CTG CGC GAA CAC GTG GAG CGC ACC 



62 
334 



RGQLGA L'ER RMAAC GNACQG 82 

CGT GGG CAG CTG GGC GCG CTG GAG CGC CGC ATG GCT GCC TGT GGT AAC GCT TGT CAG GGG 394 

P K G K D A P F K D S E D R V P E G Q T 102 

CCC AAG GGA AAA GAT GCA CCC TTC AAA GAC TCC GAG GAT AGA GTC CCT GAA GGC CAG ACT 4S4 



P E T L Q S L Q T Q L K A Q N S K I Q 0 122 

CCT GAG ACT CTG CAG ACT TTG CAG ACT CAG CTC AAG GCT CAA AAC AGC AAG ATC CAG CAA 514 

LFQ KVAQ'OQRYLSKQNLRXQ 142 

TTG TTC CAG > AAG GTG GCC CAG ■ CAG CAG AGA TAC CTA TCA AAG CAG AAT CTG AGA ATA CAG 574 

N L Q S Q I D L" L A P. T H I D N G V 0 K 162 

AAT CTT CAG AGC CAG ATA GAC CTC TTG GCC CCC ACG CAC CTA GAC AAT GGA GTA GAC AAG 634 

TSRGXRL. PKMTQLXGLTPNA 182 

ACT TCG AGG GGA AAG AGG CTT CCC AAG ATG ACC CAG CTC ATT GGC TTG ACT CCC AAC GCC 694 

T H LHR P'P R DCQELFQEGERH 202 

. ACC CAC TTA CAC AGG CCG CCC CGG GAC TGC CAG GAA CTC TTC CAA GAA GGG GAG CGG CAC 754 

SGLF QZQ PLGS PPFLVN CEM 222 

ACT GGA CTT TTC CAG ATC CAG CCT CTG GGG TCT CCA CCA TTT TTG GTC AAC TGT GAG ATG 814 



T S O G G W T V I Q R R L N G S V 0 F N 
ACT TCA GAT GGA GGC TGG 'ACA GTG ATT CAG AGA CGC CTG AAC GGC TCP GTG GAC TTC AAC 



LN CAK S L S.G GWW FGTCS KS N 
CTT AAC TGT GCC AAG AGC CTC TCT GGT GGC TGG TGG . TTT GGT ACC TGT AGC CAT TCC AAT 



242 
874 



OS W E A Y K D G FG 0 P Q G E FN L G 262 

CAG TCC TGG GAA GCC TAC AAG GAT GGC TTC GGA GAT CCC CAA GGC GAG TTC TGG CTG GGC 934 

LEKMHS I T GNRGSQLAVQLQ 282 

CTG GAA AAG ATG CAC AGC ATC ACA GGG AAC CGA GGA AGC CAA TTG GCT GTG CAG CTC CAG 994 

Off DGN A K t LQF P I H LGGEOT 302 

GAC TGG GAT GGC AAT GCC AAA TTG CTC CAA TTT CCC ATC CAT TTG GGG GGT GAG GAC ACA 1054 

A Y S L Q L T E P T A N E L G A T M V S 322 

GCC TAC AGC CTG CAG CTC ACT GAG CCC ACG GCC AAT GAG CTG GGT CCC ACC AAT GTT TCC 1114 

PNG LS L P FSTWOQDH 0 LRG D 342 

CCC AAT GGC CTT TCC CTC CCC TTC TCT ACT TGG GAC CAA GAC CAT GAC CTC CGT GGG GAC 1174 



362 
1234 



LNGQY F'HS IPRQRQ ERKKGI 382 
CTC AAT CGA CAA TAC TTC CAC TCT ATC CCA CGG CAA CGG CAG GAG CGT AAA AAG GGT ATC 1294 
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Figure 4 B _ ^ 

. . .. ,. . . -,. ... ... •, . i ~- . •> ■ o • . ;i ..; , . ,.. ... >;..... ., . , -v 

F W K T W K G R Y Y P L* Q A T T L L I Q 402 

TTC TGG AAA ACA TGG AAC GGC CGC TAG TAT CCT CTG CAG GCT ACC ACC CTG CTG ATC CAG 1354 

P M E A T A A S > > 411 

CCC ATG GAG GCT ACA GCA GCC TCT TAG r 1381 

CCTCCTCACTGGAGCCTGGTTCCAGGCCT^^ H60 

GGGGGCAGGAGCTCTAAGTAGGGCTATCTGCGTCTTGTC 1539 

CTTTTCCGTGTTGGGGTCTGCAAGCATTGTTGTCTGAAACACT 1618 

AOVTGGGCTCGAGGGGCACTGAATATC^CTTCTCGCCrrA .1697 

CTAGCTGGGCCCTTAATGGCGGACTCAGTCATATTGACTGACTG I 776 

GTGCTGTTGTAGGTGCTGTGGATGCACAGGTGCTAACTGTGGTTCCCAG WS 

CAACCTCAGAACAAAAAAAAAAAAAAAAGGGCGGCCGC ' 1893 



WO 99/45135 

Figure 5 



PCT/US99/04548 

8/13 



q G RPAQ PEPPRFASW DE M N .19 
CAA GGG CGC CCT GCA CAG CCA GAG CCA CCG CGC TTT GCA TCC TGG GAC GAG ATG AAC 57 



L 
TTG 


, L 
CTG 


AHGLLQLGHGL R. E HvV E R T 
GCT CAC GGG CTG CTA CAG CTC GGC CAT GGG CTG CGC GAA CAC GTG GAG CGC ACC 


39 

"7 


t' 
R 
CGT 


G Q L G A L E R RM AA C G N-A C QG 
GGG CAG CTG GGC GCG CTG GAG CGC CGC ATG GCT GCC TGT GGT AAC GCT TGT CAG, GGG 


59 
177 


P 

CCC 


K 
AAG 


gkdapfk'dsedrv p eg qt 
gga aaa gat gca ccc ttc aaa gac tcc gag gat aga gtc cct gaa ggc cag act 


79 
237 


p 

CCT 


£ 
GAG 


TLQSLQTQLK AQNS KIQQ 
ACT CTG CAG AGT TTG CAG ACT CAG CTC AAG GCT CAA AAC AGC AAG ATC CAG CAA 


99 
297 


L 
TTG 


F 
TTC 


q K V A Q Q Q R Y L S'K Q N' L R I Q ' 
CAG AAG GTG GCC CAG CAG CAG AGA TAC CTA TCA AAG CAG AAT CTG AGA ATA CAG 


119 

357. 


N 
A AT 


L 
CTT 


q' S Q I D L L A P T H L D N G V D K 
CAG AGC CAG ATA GAC CTC TTG GCC CCC ACG CAC CTA GAC AAT >GGA GTA GAC AAG 


139 
417 


T 
ACT 


S 
TCG 


RGK R LPKMTQLIG L:*" T. P N A 
AGG GGA AAG AGG CTT CCC AAG ATG ACC CAG CTC ATT GGC TTG ACT CCC AAC GCC 


159 
477 


T 
ACC 


H 
CAC 


lhrpprdcqel fq e gerh 

TTA CAC AGG CCG CCC CGG GAC TGC CAG GAA CTC TTC CAA GAA GGG GAG CGG CAC 


179 

537 u 


S G L F Q I Q P L G S P P F L V , N C E M 
AGT GGA CTT TTC CAG ATC CAG CCT CTG GGG TCT CCA CCA TTT TTG GTC.jjAC TGT GAG ATG 


199 
597 


T S D G G W T.V.I Q R. R L N G S. V D F N 
ACT TCA GAT GGA GGC TGG ACA GTG ATT CAG AGA CGC CTG AAC GGC TCT GTG GAC TTC AAC 


219 
657 


CAG 


S 
TCC 


WEAYKDGFGDPQGE FW LG 
TGG GAA GCC TAC AAG GAT GGC TTC GGA GAT CCC CAA GGC GAG TTC TGG CTG GGC 


239 
717 


L E K M H • S I *T G N R 6 S Q L A V Q L Q. 
CTG GAA AAG ATG CAC AGC ATC ACA GGG AAC CGA GGA AGC CAA TTG GCT GTG CAG CTC CAG 


259 
777 


0 
GAC 


w 

TGG 


D G N A K L L Q F P I H L-G G E D T 
GAT GGC AAT GCC AAA TTG CTC CAA TTT CCC ATC CAT TTG GGG GGT GAG GAC ACA 


279 
837 


A 
GCC 


Y 
TAC 


SLQLTEPTA N ELGA TNVS 
AGC CTG CAG CTC ACT GAG CCC ACG GCC AAT GAG CTG GGT GCC ACC AAT GTT TCC 


299 
897 


P' 
CCC 


N 
AAT 


G L S L P F S T W D Q D H D L R • G D 
GGC CTT TCC CTG CCC TTC TCT ACT TGG GAC CAA GAC CAT GAC CTC CGT GGG GAC 


319 
957 


L 
CTT 


N 
AAC 


c A K S L S G G W W F ' G T C jS H S N 
TGT GCC AAG AGC CTC TCT GGT. GGC TGG TGG TTT GGT ACC TGT AGC CAT TCC AAT 


339 
1017 


L N G Q Y F H S I P R Q R Q E R K K G I 
CTC AAT GGA CAA TAC TTC CAC TCT ATC CCA CGG CAA CGG CAG GAG CGT AAA AAG GGT ATC 


359 
1077 


F ' 
TTC 


W 
TGG 


KTWKGRY YP LQATT.L LI.Q 
AAA ACA TGG AAG GGC CGC TAC TAT CCT CTG CAG GCT ACC ACC CTG CTG ATC CAG 


379 
1137- 


P 

CCC 


M 
ATG 


E A T A A S 
GAG GCT ACA GCA GCC TCT 


387 
1161 
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10 20 30 40 50 
inputs GTCGACCCACGCGTCCGGCTC-CAGATCT-TCTTCTGCACCAG AGCAA-GTCTAAGT 

GTCGACCCACGCGTCCGCCTCACACGACTGTGATCCGATTCTTTCCAGCGGCTTCTGCAA 
10 20 30 ' 40 50 60 

60 * . 70 80 90 100, 

. inputs CTGAGCCGG-CT — CCCCCAGAACTCCA — GCTGCTGG GTCTTGAACTCCTGCG 

. . : : : : . : ' s • • ■ • • • • • • '*.• ■■• ■ • 

CCAAGCGGGTCTTACCCCCGGTCCTCCGCGTCTCCAGTCCTCGCACCTGGAACCCCAACG 

70 ... 80 90, 100 110 120 

HO 120 130 140 150 160 

inputs TTCC — GGAGTCCTAGCGTTGCTGCACCCAAGGCCACCCCCAGAATCATGCGCTGCGCTC 

TCCCCGAGA'GTCCCCGAATCCCCGCTCCCAGG--CTACCTAAGAGG-ATGAGCGGTGCTC 
130 140 150 160 170 

170 180 190 200 210 ' 220 

inputs CGACAGCAGGCGCTGCCCTGGTGCTATGCGCGGCTACTGCGGGGCTTTTGAGCGCGCAAG 

: : : : ::::: :: - • I::::::::.: 

CGACGGCCGGGGCAGCCCTGATGCTCTGCGCCGCCACCGCCGTGCTACTGAGCGCTGAGG 

180 190 200 210 220 230 

230 ' 240. 250 .260 270 280 

inputs GGCGCCCTGCACAGCCAGAGCCACCGCGCTTTGCATCCTGGGACGAGATGAACTTGCTGG 

: .:: : :: : : : : : : t ' • '• :s:: 

GCGGACCCGTGCAGTCCAAGTCGCCGCGCTTTGCGTCCTGGGACGAGATGAATGTCCTGG 

240 250 260 270 280 290 

290 300 310 320 \ 330 340 

input s • CTCACGGGCTGCTACAGCTCGGCCATGGGCTGCGCGAACACGTGGAGCGCACCCGTGGGC 

.. :::::::::::::::: :::::::::::: .: : 

CGCACGGACTCCTGCAGCTCGGCCAGGGGCTGCGCGAACACGCGGAGCGCACCCGCAGTC 

300 -310 320 330 340 350 

350 360 370 380 390 400 

inputs AGCTGGGCGCGCTGGAGCGCCGCATGGCTGCCTGTGGTAACGCTTGTCAGGGGCCCAAGG 

..... :::::: : : : : : . : : : : : : • : : : : : • : : : 

AGCTGAGCGCGCTGGAGCGGCGCCTGAGCGCGTGCGGGTCCGCCTGTCAGGGAACCGAGG 

360 370 380. 390 400 410 

•■ 410 420 430 440 450 

inputs GAAAAGATGCACCCTTCAAAGACTCCGAGGATAGAGTCCCTGAAGGCCAGACT— CCTG 

. « • » • • • • • • • • * • 

GQT -GCACCGAC CTCCC— GTTAG— CCCCTGAGAGCCGGGTGGAGCCTG 

430 440 450 460 



420 



460 470 480 490 .. 500 510 

incuts AGACTCTGCAGAGTTTGCAGACTCAGCTCAAGGCTCAAAACAGCAAGATCCAGCAATTGT 

F .. :: :::::::: : • *, 

AGGTCCTTCACAGCCTGCAGACACAACTCAAGGCTCAGAACAGCAGGATCCAGCAACTCT 

480 490 S00 " 510 520 



470 
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520 530 540 550 560 570' 

inputs TCCAGAAGGTGGCCCAGCAGCAGAGATACCTATCAAAGCAGAATCTGAGAATACAGAATC 
:::: : ::: ::: 

, , TCCACAAGGTGGCCCAGCAGCAGCGGCACCTGGAGAAGCAGCACCTGCGAATTCAGCATC 
530 540 -550 560 570 580 

■ ,.580 590 600 610 620 .630 

inpu t s TTCAGAGCCAGATAGACCTCTTGGCCCCCACGCACCTAGACAATGGAGTAGACAAGACTT ■ 

TGCAAAGCCAGTTTGGCCTCCTGGACCACAAGCACCTAGACCATGAGGTGGCCAAGCCTG 
590 600 ' 610 , 620 630 . 640 

640 650 660 670 - . 680 690^ 

inputs CGAGGGGAAAGAGGCTTCCCAAGATGACCCAGCTCATTGGCTTGACTCCCAACGCCACCC 

CCCGAAGAAAGAGGCTGCCCGAGATGGCCCAGCCAGTTGACCCGGCTCAGAATGTCAGCC 
650 660 670 ; =680 . " 690 '700 

700 710 .720 730 740 V 750" 

inputs ACTTACACAGGCCGCCCCGGGACTGCCAGGAACTCTTCCAAGAAGGGGAGCGGCACAGTG 
. .... • ■ • . . . . • * • • •«•<•• . • ♦ • •■•* 

. . • a • • • • ■ • «»•• •••• ••••••••••• • » »,•■• •••••• 9* •• ••••••••• 

GCCTGCACCGGCTGCCCAGGGATTGCCAGGAGCTGTTCCAGGTTGGGGAGAGGCAGAGTG 
710 720 730 740 750 760 . 

7.60, 770.' 780 790. * " 800 810 / 

inputs GACTTTTCCAGATCCAGCCTCTGGGGTCTCCACCATTTTTGGTCAACTGTGAGATGACTT 

GACTATTTGAAATCCAGCCTCAGGGGTCTCCGCCATTTTTGGTGAACTGCAAGATGACCT 
770 780 790 800 810 / .820 

820 830 840 850 860 \ 870 

inputs CAGATGGAGGCTGGACAGTGATTCAGAGACGCCTGAACGGCTCTGTGGACTTCAACCAGT 

CAGATGGAGGCTGGACAGTAATTCAGAGGCGCCACGATGGCTCAGTGGACTTCAACCGGC 
830 840 850 860 .870 880 

880 890* 900 910 920 ' 930 

inputs CCTGGGAAGCCTACAAGGATGGCTTCGGAGATCCCCAAGGCGAGTTCTGGCTGGGCCTGG 

!!!!!!!•••!• ■• • • ' ••••••• • «..•.••• • 

, CCTGGGAAGCCTACAAGGCGGGGTTTGGGGATCCCCACGGCGAGTTCTGGCTGGGTCTGG 
890 900 - 910 ' 920 \ 930 940 

940 950 960 970 980 990 

inputs AAAAGATGCACAGCATCACAGGGAACCGAGGAAGCCAATTGGCTGTGCAGCTCCAGGACT 

AGAAGGTGCATAGCATCACGGGGGACCGCAACAGCCGCCTGGCCGTGCAGCTGCGGGACT ' 
950 960 970 \ 980 990 * 1000 

1000 1010 1020 1030 ' 1040 , ;1050 

inputs GGGATGGCAATGCCAAATTGCTCCAATTTCCCATCCATTTGGGGGGTGAGGACACAGCCT 

GGGATGGCAACGCCGAGTTGCTGCAGTTCTCCGTGCACCTGGGTGGCGAGGACACGGCCT 
.1010 1020 1030 1040 1050, 1060 
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1060 * 1070 1080 1090 1100 1110 ' 

inputs ACAGCCTGCAGCTCACTGAGCCCACGGCCAATGAGCTGGGTGCCACCAATGTTTCCCCCA 
. ::::::: ::::::: : : : : : : : 

ATAGCCTGCAGCTCACTGCACCCGTGGCCGGCCAG 

. 1070 1080 1090 1100 1110 ' 1120 

i. 

•1120 1130 • 1140 1150 1160' 1170 

inputs ATGGCCTTTCCCTGCCCTTCTCTACTTGGGACCAAGACCATGACCTCCGTGGGGACCTTA 

. . • • j ;;j;jt"js . I i I I t m • 

• ' G cGGCCTCtCCGTACCCTTCTCCACTTGGGACCAGGATCACGACCTCCGCAGGGACAAGA 
1130 ■ 1140' 1150: 1160 1170 1180 . 

1180. 1190 • 1200 .1210 1220 1230 

inputs ACTGTGCCAAGAGCCTCTCTGGTGGCTGGTGGTTTGGTACCTGTAGCCATTCCAATCTCA 

ACTGCGCCAAGAGCCTGTCTGGAGGCTGGTGGTTTGGCACCTGCAGCCATTCCAAC 
-:• U90 . ' 1200 1210 1220 1230 1240 

1240 ; 1250 .1260 1270 1280 1290 

inDUtS ATGGACAATACTTCCACTCTATCCCACGGCAACGGCAGGAGCGTAAAAAGGGTATCTTCT 

. .•••••;;;;.! J! • 

• . ACGGCCAGTACTTCCGCTCCATCCCACAGCAGCGGCAGAAGCTTAAGAAGGGAATCTTCT 

1250 1260 . 1270 1280 1290 1300 

1300 . 1310 • - 1320 1330 1340 1350 

input s GGAAAACATGGAAGGGCCGCTACTATCCTCTGCAGGCTACCACCCTGCTGATCCAGCCCA 

::::::::::: : : :::::::: ! i ■ 

'• GGAAGACCTGGCGGGGCCGCTACTACCCGCTGCAGGCCACCACCATGTTGATCCAGCCCA 

1310 1320 i330 1340 1350 1360 

1360 1370 •' 1380 1390 1400 1410 . 

inputs TGGAGGCTACAGCAGCCTCTTAGCCTCCTCACTGGAGCCTGGTTCCAGGCCTAAGAA-GA 

:::: :::: ::::::: : " '■■ 

' TGGCAGCAGAGGCAGCCTCCTAGCGTCCTGGCTGG-GCCTGGTCCCAGGCCCACGAAAGA 

1370 1380 1390 1400 1410 1420 

1420' - 1430** 1440 1450 1460 1470 

input S CAGTGACTTTGGtTGTGGCCCTGAGATTTGGCCATTCTCTGCTGGGGGCAGGAGCTC-TA ■ 

' CGGTGAckTTGGCTCTGCCCGAGGATGTGGCCGTTCCCTGCCTGGG-CAGGGGCTCCM 
1430 • .1440 1450 1460 1470 1480 

I486 1490 1500 1510 1520 1530 

inputs AGtAGGGCTATCTG-CGTCTTGTGGACAAAGAAGAAGCCCGTAACTGGAGAGACTGGAGG 



■ GGAGGGGCCATCTGGAAACTTGTGGACAGAGAAGAAGACCACGACTGGAGAAGC 

1490 ■ 1500 1510 1520 1530 

• 1540 ' ' 1550 • 1560 :' 1570 1580 1590 

inputs ACCCCTTTTCCGTGTT---GGGGTCTGCAAGCATTGTTGTCTGAAACAGTCAGAGCAACA 

-CCCCT.TT-CTGAGTGCAGGGGGGCTGCATGCGTTGCCTCCTGAGA --TCGAGGCTGCA 
"1540 1550 1560 1570 1580 
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1 1600 1610 1620 1630 1640 1650 

inputs GGAAACAAATGGCCCAGATCCAGAAAACATGGGCTCGAGGGGCACTGAATATCACTTCTC 
:::::: 

GGATA TGCTCAGACTCTAGAGGCGTGGAC-CAAGGGGCATGGAGCTTCACTCCTT 

1590 1600 1610 1620 1630 1640 

1660 . 1670 1680 1690 1700 1710 

inputs GCCTACCAGAGAAGTTGGGGATGCAGAGGGACCACTACAGTCCAACTAG-CTGGGCCCTT 
• • •••• «♦•••••'• sisiiss'j •••••• • * * I I • • * * • Z • J * • ■ 

GCTGGCCAGGGA-GTTGGGGACTCAGAGGGACCACTTGGGGCCAGCCAGACTGG — CCTC 
1650 * 1660 .1670 1680 1690 1700 

1720 1730 1740 1750 1760 1770 

inputs AATGGCGGACTCAGTCATATTGACTGACTGGAGACAGGG-TGCCAGGAGCCCTGGATACA 

AATGGCGGACTCAGTCACATTGACTGACGGGGACCAGGGCTTGTGTGGGTCGAGAGCGCC 
1710 1720 .1730 ■ 1740 . 1750. 1760 

1780 • 1790 1800 1810 

inputs C'TCATGGTGCTGTTGfAGGTGC-TGTGGATGCACAGGTG— -CTAACTG— TGGT 

CTCATGGTGCTGGTGCTGTTGTGTGTAGGTCCCCTGGGGACACAAGCAGGCGCGAATGGT 
1770 1780 1790 1800- 1810 1820 

1820 1830 1840 ' 1850 1860 187.0 

inputs TCCCAGGCACAGCTCACAGCATTCTTACAATAAAAACAACCTCAGAACAAAAAAAAAAAA 



ATCTGGGCGGAGCTCACAGAGTTCTTGGAATAAAAGCAACCTCAGAACAAAAAAAAAAAA 
1830 1840 1850 I860 1870 1880 

1880 1890 11 

inputs AAAAGGGCGGCCGC ' 

AAA-GGGCGGGCGC 
1890 
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inputs , MSGAPTAGAALMLCAATAVLLSAQGGPVQSKSPRFASWDEMNVLAHGLLQLGQGLREHAE 

• •••••«•• •••••• • • •••••••••• ••••••••• ••••• * 

• ••••••••••••••• • •• •••• ••••••••••••• 

MRCAPTAGAALVLCAATAGLLSAQGRPAQPEPPRFASWDEMNLLAHGLLQLGHGLREHVE 
10 20 30 40 50 60 



70 80 90 . 100 110 
inputs RTRSQLSALERRLSACGSACQGTEGSTDLPLAPESRV DPEVLHSLQTQLKAQNSRI 



RTRGQLGALERRMAACGNACQGPKGKDAPFKDSEDRVPEGQTPETLQSLQTQLKAQNSKI 
70 80 90 100 110 120 



120 130 140 150 160 170 

inputs QQLFHKVAQQQRHLEKQHLRIQHLQSQFGLLDHKHLDHEVAKPARRKRLPEMAQPVDPAH 

• Baa. a • a • a a a • • a • a • • • • •• a • a « • • a a a a • • 

QQLFQKVAQCWRYLSKQNLRIQNLQSQIDLIiAPTHLDNGVDKTSRGKRLPKMTQ 

130 140 150 160 . 170 180 



180 190 200 210 220 230 

inputs NVSRLHRLPRDCQELFQVGERQSGLFEIQPQGSPPFLVNCKMTSDGGWTVIQRRHDGSVD 

NATHLHRPPRDCQELFQEGERHSGLFQIQPLGSPPFLVNCEMTSDGGWTVIQRRLNGSVD 
190 200 210 220 230 240 

240 250 260 270. 280 290 

inputs FNRPWEAYKAGFGDPHGEFWLGLEKVHSITC^ 

TOQSWEAYKDGFGDPQG 

250 260 270 280 290 300 



300. 310 320 330 340 350 

inputs DTAYSLQLTAPVAGQLGATTVPPSGLSVPFSTWDQDHDLRRDKNCAKSLSGGWWFGTCSH 

DTAYSLQLTEPTANELGATNVSPNGLSLPFSTWDQDHDLRGDLN^ 

310 320 330 340 350 360 



360 370 380 390 400 

inputs SNLNGQYFRS I PQQRQKLKKGI FWKTWRGRYYP LQATTMLIQPMAAEAAS 

SNLNGQYFHSI PRQRQERKKGI FWKTWKGRY YPLQATTLLIQPMEATAAS 

370 380 390 400 410 
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SEQUENCE LISTING 

<110> Millennium Biotherapeutics, Inc. 

5 <120> NOVEL FDRG PROTEIN AND NUCLEIC ACID MOLECULES AND USES THEREFOR 

<130> MEI-015CPPC ' 

<140> 
10 <141> 

<150> 09/033,539 
<151> 1998-03-02 

15 <160> 27 

<170> Patent In Ver. 2.0 

<21Q> 1 

20 <211> 1894 , 
<212> DNA ( 
<213> Homo sapiens 1 

<220> 
25 <221> CDS 

. <222> (166) . . (1383) 



30 



60 



<400> 1 

agtcgaccca cgcgtccgcc tcacacgact gtgatccgat tctttccagc ggcttctgca 60 
accaagcggg tcttaccccc ggtcctccgc gtct'ccagtc ctcgcacctg gaaccccaac 120 



gtccccgaga gtccccgaat ccccgctccc aggctaccta. agagg atg age ggt get 177 

Met Ser Gly Ala 

35" * ' ' 1 

ccg acg gec ggg gca gec ctg atg etc tgc gee gee ace gee gtg eta 225 

Pro Thr Ala Gly Ala Ala Leu Met Leu Cys Ala Ala Thr Ala Val Leu 

5 10 * * 15 20 

40 

ctg age get cag ggc gga ccc gtg cag tec aag teg ccg cgc ttt gcg 273 

Leu Ser Ala Gin Gly Gly Pro Val Gin Ser Lys Ser Pro Arg Phe Ala 

25 30 ' 35 

45 tec tgg gac gag atg aat gtc ctg gcg cac gga etc ctg cag etc ggc 321 
Ser Trp Asp Glu Met Asn Val Leu Ala His Gly Leu Leu Gin Leu Gly 
40 45 50 

cag ggg ctg cgc gaa cac gcg gag cgc acc cgc agt cag*ctg age gcg 369 
50 Gin Gly Leu Arg Glu His Ala Glu Arg Thr Arg Ser Gin Leu Ser Ala 
55 60 65 

ctg gag egg cgc ctg age gcg tgc ggg tec gee tgt cag gga acc gag 417 
Leu Glu Arg Arg Leu Ser Ala Cys Gly Ser Ala Cys Gin Gly Thr Glu 
55 70 75 80 

ggt cca ccg acc etc ccg tta gee cct gag age egg gtg gac cct gag 465 
Gly Pro Pro Thr Leu Pro Leu Ala Pro Glu Ser Arg Val Asp Pro Glu 
85 90 95 , x0 ° 



gtc ctt cac age ctg cag aca caa etc aag get cagaac age agg ate 513 
Val Leu His Ser Leu Gin Thr Gin Leu Lys Ala Gin Asn Ser Arg lie 
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cag caa etc ttc cac aag gtg gec cag cag cag egg cac ctg gag aag 561 
Gin Gin Leu Phe His Lys Val Ala Gin Gin Gin Arg His Leu Glu Lys 
5 120 125 130 



cag cac ctg cga att cag cat ctg caa age cag ttt ggc etc ctg gac 
Gin His Leu Arg lie Gin His Leu Gin Ser Gin Phe Gly Leu Leu Asp 
135 140 145 

10 

cac aag cac eta gac cat gag gtg gec aag cct gee cga aga aag agg 
His Lys His Leu Asp His Glu Val Ala Lys Pro Ala Arg Arg Lys Arg 
150 155 * 160 

15 ctg ccc gag atg gec cag cca gtt gac ccg. get cac aat gtc age cgc 
' Leu Pro Glu Met Ala Gin Pro Val Asp Pro Ala His Asn Val Ser Arg 
. i 6 5 170 



20 Leu His Arg Leu Pro Arg Asp Cys Gin Glu. Leu Phe Gin Val Gly Glu 
185 190 195 

agg cag agt gga eta ttt gaa ate cag cct cag ggg tct ccg cca ttt 
Arg Gin Ser Gly Leu Phe Glu He Gin Pro Gin Gly Ser Pro Pro Phe 
25 200 205 210 



50 



55 ' ccc ttc tec act tgg gac cag gat cac gac etc cgc agg gac aag aac 
Pro Phe Ser Thr Trp Asp Gin Asp His Asp Leu Arg Arg Asp Lys Asn 
325 330 335 340 



609 



657 



705 



175 * 180 
ctq cac egg ctg ccc agg gat tgc cag gag ctg ttc cag gtt ggg gag 753 



801 



849 



945 



993 



ttg gtg aac tgc aag atg ace tea gat gga * ggc tgg aca gta att cag 
Leu Val Asn Cys Lys Met Thr Ser Asp Gly Gly Trp Thr Val lie Gin 
215 220 ^ 225 

30 

agg cgc cac gat ggc tea gtg gac .ttc aac. egg ccc. tgg gaa gee tac 897 
Arg Arg His Asp Gly Ser Val Asp Phe Asn Arg Pro Trp Glu Ala Tyr 
230 235 240 

35 aag gcg ggg ttt ggg gat ccc cac ggc, gag ttc tgg ctg ggt ctg gag 
Lys Ala Gly Phe Gly Asp Pro His Gly Glu Phe Trp Leu Gly Leu Glu 
245 250 255 260 

aag gtg cat age ate acg ggg gac cgc aac age -cgc ctg gec gtg cag 
40 Lys Val His Ser He Thr Gly Asp Arg Asn Ser Arg Leu Ala Val Gin 
265 270 k 275 

* ctg egg gac tgg gat ggc aac gec gag ttg ctg cag ttc tec gtg cac 1041 
Leu Arg Asp Trp Asp Gly Asn Ala Glu Leu Leu Gin Phe Ser Val His 
45 280 285 290 

ctg ggt ggc gag gac acg gee tat age ctg cag etc. act gca ccc gtg 1089 
Leu Gly Gly Glu Asp Thr Ala Tyr Ser Leu Gin Leu Thr Ala Pro Val 
295 300 , 305 

gee ggc cag ctg ggc gee ace ace gtc cca ccc age ggc etc tec gta 1137 
Ala Gly Gin Leu Gly Ala Thr Thr Val Pro Pro Ser Gly Leu Ser Val 
310 315 . , 320 



1185 



60 



1233 
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tec aac etc aac ggc cag tac ttc cgc tec ate cca cag cag egg cag 1281 
Ser Asn Leu Asn Gly Gin Tyr Phe Arg Ser lie Pro Gin Gin Arg Gin 
360 365 370 

5 aag ctt aag aag gga ate ttc tgg aag ace tgg egg ggc cgc tac tac 1329 
Lys Leu Lys Lys Gly lie Phe Trp Lys Thr Trp Arg Gly Arg Tyr Tyr 
375 380 385 

ccg ctg cag gee ace ace atg ttg ate cag ccc atg gca gca gag gca 1377 
10 Pro Leu Gin Ala Thr Thr Met Leu lie Gin Pro Met Ala Ala Glu Ala 
390 395 400 

gee tec tagegtcctg gctgggcctg gtcccaggcc cacgaaagac ggtgactctt 1433 
Ala Ser 
15 405 

ggctctgccc gaggatgtgg ccgttccctg cctgggcagg ggctccaagg aggggecate 1493 
tggaaacttg tggacagaga agaagaccac gactggagaa gccccctttc tgagtgcagg 1553 

20 

ggggctgeat gcgttgcctc ctgagatcga ggctgcagga tatgetcaga ctctagaggc 1613 
gtggaccaag gggcatggag cttcactcct tgctggccag ggagttgggg actcagaggg 1673 
25 accacttggg gccagccaga ctggcctcaa tggeggaetc agtcacattg actgaegggg 1733 
accagggctt gtgtgggtcg agagcgccct catggtgctg gtgctgttgt gtgtaggtcc 1793 
cctggggaca caagcaggcg ccaatggtat ctgggcggag ctcacagagt tcttggaata 1B53 
aaagcaacct cagaacaaaa aaaaaaaaaa agggeggecg c 1894 



30 



<210> 2 
35 <211> 406 
<212> PRT 

<213> Homo sapiens 
<400> 2 

40 Met Ser Gly Ala Pro Thr Ala Gly Ala Ala Leu Met Leu Cys Ala Ala 
1 5 10. 15 

Thr Ala Val Leu Leu Ser Ala Gin Gly Gly Pro Val Gin Ser Lys Ser 
20 25 30 



45 



60 



Pro Arg Phe Ala Ser Trp Asp Glu Met Asn Val Leu Ala His Gly Leu 
35 40 45 



Leu Gin Leu Gly Gin Gly Leu Arg Glu His Ala Glu Arg Thr Arg Ser 
50 .50 55 60 

Gin Leu Ser Ala Leu Glu Arg Arg Leu Ser Ala Cys Gly Ser Ala Cys 
65 70 75 80 

55 Gin Gly Thr Glu Gly Pro Pro Thr Leu Pro Leu Ala Pro Glu Ser Arg 
85 90 95 

Val Asp Pro Glu Val Leu His Ser Leu Gin Thr Gin Leu Lys Ala Gin 
100 105 110 



Asn Ser Arg lie Gin Gin Leu Phe His Lys Val Ala Gin Gin Gin Arg 
115 120 125 
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His Leu Glu Lys Gin His Leu Arg He Gin His Leu Gin Ser Gin Phe 
130 135 140 

5 Gly Leu Leu Asp His Lys His Leu Asp His Glu Val Ala Lys Pro Ala 
145 150 155 160 

Arg Arg Lys Arg Leu Pro Glu Met Ala Gin Pro Val Asp Pro Ala His 
165 170 175 

10 

Asn Val Ser Arg Leu His Arg Leu Pro Arg Asp Cys Gin Glu Leu Phe 
180 185 190 

Gin Val Gly Glu Arg Gin Ser Gly Leu Phe Glu He Gin Pro Gin Gly 
15 195 200 205 

Ser Pro Pro Phe Leu. Val Asn Cys Lys Met Thr Ser Asp Gly Gly Trp 
210 . 215 220 

20 Thr Val He Gin Arg Arg His Asp Gly Ser Val Asp Phe Asn Arg Pro 
225 ' 230 235 240 

Trp Glu Ala Tyr Lys Ala Gly Phe Gly Asp Pro His Gly Glu Phe Trp 
245 250 255 

25 

Leu Gly Leu Glu Lys Val His Ser lie Thr Gly Asp Arg Asn Ser Arg 
260 265 270 

Leu Ala Val Gin Leu Arg Asp Trp Asp Gly Asn Ala Glu Leu Leu Gin 
30 275 280 285 

Phe Ser Val His Leu Gly Gly Glu Asp Thr Ala Tyr Ser Leu Gin Leu 
290 295 300 

35 Thr Ala Pro Val Ala Gly Gin Leu Gly Ala Thr Thr Val Pro Pro Ser 
305 310 315 320 

Gly Leu Ser Val Pro Phe Ser Thr Trp Asp Gin Asp His Asp Leu Arg 
325 - 330 335 

40, 

Arg Asp Lys Asn Cys Ala Lys Ser Leu Ser Gly Gly Trp Trp Phe Gly 
340 345 350 

Thr Cys Ser His Ser Asn Leu Asn Gly Gin Tyr Phe Arg Ser He Pro 
45 355 360 365 

'Gin Gin Arg Gin Lys Leu Lys Lys Gly He Phe Trp Lys Thr Trp Arg 
370 375 380 

50 Gly Arg Tyr Tyr Pro Leu Gin Ala Thr Thr Met Leu He Gin Pro Met 
385 390 395 400 

Ala Ala Glu Ala Ala Ser 
405 

55 

<210> 3 
<211> 1218 
<212> DNA 
60 <213> Homo sapiens 



PCTAJS99/04548 



<220> 



WO 99/45135 « PCT/US99/04548 

<221> CDS 

<222> (1) . . (1218) 

. <400> 3 ; 
5 atg age ggt get ccg acg gee ggg gca gee ctg atg etc tgc gee gee 48 
Met Ser Gly Ala Pro Thr Ala Gly Ala Ala Leu Met Leu Cys Ala Ala 
1*5- 10 15 

ace gee gtg eta ctg age get cag ggc gga ccc gtg cag tec aag teg 96 
10 Thr Ala Val Leu Leu Ser Ala Gin Gly Gly Pro Val Gin Ser Lys Ser 

20 • * 25 * ' 30 

ccg cgc ttt gcg tec tgg gac gag atg aat gtc ctg gcg cac gga etc 144 
Pro Arg Phe Ala Ser Trp Asp Glu Met Asn Val Leu Ala His Gly Leu 
15 35 40 45 

ctg cag etc ggc cag ggg ctg cgc gaa cac gcg gag cgc acc cgc agt 192 

Leu Gin Leu Gly Gin Gly Leu Arg Glu His Ala Glu Arg Thr Arg Ser , 

50 55 60 
20 • , • , 

• cag ctg age gcg ctg gag egg cgc ctg age gcg tgc ggg tec gec tgt 240 

Gin Leu Ser Ala Leu Glu Arg Arg Leu Ser Ala Cys Gly Ser Ala Cys 

65 70 75 80 

25 cag gga acc gag ggt cca ccg acc etc ccg tta gee cct gag age egg 288 
- Gin Gly Thr Glu Gly Pro Pro Thr Leu Pro Leu Ala Pro Glu Ser Arg 
85 90 95 

gtg gac cct gag gtc ctt cac age ctg cag aca caa etc aag get cag 336 
30 Val Asp Pro Glu Val Leu His Ser Leu Gin Thr Gin Leu Lys Ala Gin 
100 105 110 

aac age agg ate cag caa etc ttc cac aag gtg gee cag cag cag egg 384 
Asn Ser Arg He Gin Gin Leu Phe His Lys Val Ala Gin Gin Gin Arg 
35 115 120 125 

cac ctg gag aag cag cac ctg cga att cag cat ctg caa age cag ttt 432 
• . 'His Leu Glu Lys Gin His Leu Arg He Gin His Leu Gin Ser Gin Phe 
130 135. 140 . 



40 



60 



ggc etc ctg gac cac aag cac eta gac cat gag gtg gee aag cct gee 
Gly Leu Leu Asp His Lys His Leu Asp His Glu Val Ala Lys Pro Ala 
145 150 155 160 



480 



45 cga aga aag agg ctg ccc gag atg gee cag cca gtt gac ccg get cac 528 
Arg Arg Lys Arg Leu Pro Glu Met Ala Gin Pro Val Asp Pro Ala His 

» ' . 165 170 175 

aat gtc age cgc ctg cac egg ctg ccc agg gat tgc cag gag ctg ttc 576 
50 Asn Val Ser Arg Leu His Arg Leu Pro Arg Asp Cys Gin Glu Leu Phe 
180 185 190 

cag gtt ggg gag agg cag agt gga eta ttt gaa ate cag cct cag ggg 624 
Gin Val Gly Glu Arg Gin Ser Gly Leu Phe Glu He Gin Pro Gin Gly 
55 195 200 205 

tct ccg cca ttt ttg gtg aac tgc aag atg acc tea gat gga ggc tgg 672 
Ser Pro Pro Phe Leu Val Asn Cys Lys Met Thr Ser Asp Gly Gly Trp 
210 215 220 



aca gta att cag agg cgc cac gat ggc tea gtg gac ttc aac egg ccc 
Thr Val He Gin Arg Arg His Asp Gly Ser Val Asp Phe Asn Arg Pro 



720 
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225 



230 



235 



240 



tgg gaa gcc tac aag gcg ggg ttt ggg gat ccc cac ggc gag ttc tgg 
Trp Glu Ala Tyr Lys Ala Gly Phe Gly Asp Pro His Gly Glu Phe Trp 
5 245 250 * 255 

ctg ggt ctg. gag aag gtg cat age ate acg ggg gac cgc aac age cgc 

Leu Gly Leu Glu Lys Val His Ser He Thr Gly Asp Arg Asn Ser Arg 
260 265 270 

10 

ctg gcc gtg cag ctg egg gac tgg gat ggc aac gcc gag ttg ctg cag 

Leu Ala Val Gin Leu Arg Asp Trp Asp Gly Asn Ala Glu Leu Leu Gin 

275 280 .,. 285. 

ttc tec gtg cac ctg ggt ggc gag gac acg gcc tat age ctg cag etc 
Phe Ser Val His Leu Gly Gly Glu Asp Thr Ala Tyr Ser Leu Gin Leu 
290 295 300 

act gca ccc gtg gcc gge cag ctg ggc gcc ace ace gtc cca ccc age 
Thr Ala Pro Val Ala Gly Gin Leu Gly Ala Thr Thr Val Pro Pro Ser 
305 310 * 315 320. 

ggc etc tec gta ccc ttc tec act tgg gac cag gat cac gac etc cgc 
Gly Leu Ser Val Pro Phe Ser >Thr Trp Asp Gin Asp His Asp Leu Arg 
325 330 335 

agg gac aag aac tgc gcc aag age etc tct gga ggc tgg tgg ttt ggc 
Arg Asp Lys Asn Cys Ala Lys Ser Leu Ser Gly Gly Trp Trp Phe Gly 
340 345 350 

ace tgc age cat tec aac etc aac ggc cag tac ttc cgc tec. ate cca 
Thr Cys Ser His Ser Asn Leu Asn Gly Gin Tyr Phe Arg Ser lie. Pro 
355 360 365 

35 cag cag egg cag aag ctt aag aag gga ate ttc tgg aag ace tgg egg 
Gin Gin Arg Gin Lys Leu Lys Lys Gly He Phe Trp Lys Thr Trp Arg 
370 375 380 



15 



20 



25 



30 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



ggc cgc tac tac ccg ctg cag gcc- ace ace atg ttg ate cag ccc atg 
40 Gly Arg Tyr Tyr Pro Leu Gin Ala Thr Thr Met Leu He Gin Pro Met 
385 390 395 400 

gca gca gag gca gcc tec 
Ala Ala Glu Ala Ala Ser 
45 405 



1200 



1218 



<210> 4 
<211> 1143 
50 <212> DNA 

<213> Homo sapiens 



55 



<220> 
<221> CDS 
<222> (1) . 



(1143) 



60 



<400> 4 

gga ccc gtg cag tec aag teg ccg cgc ttt gcg tec tgg gac gag atg 48 
Gly Pro Val Gin Ser Lys Ser Pro Arg Phe Ala Ser Trp Asp Glu Met 
x 5 10 ' 15 



aat gtc ctg gcg cac gga etc ctg cag etc ggc cag ggg ctg cgc gaa 



96 
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Asn Val Leu Ala His Gly Leu Leu Gin Leu Gly Gin Gly Leu Arg Glu 

20 25 " ♦ 30 

cac gcg gag cgc acc cgc agt cag' ctg age gcg ctg gag egg cgc ctg 144 

5 ■ His Ala Glu Arg Thr Arg Ser Gin Leu Ser Ala Leu Glu Arg Arg Leu 
35 40 .45 

age gcg tgc ggg tec gee tgt cag gga acc gag ggt cca ccg acc etc 192 

Ser Ala Cys Gly Ser Ala Cys Gin Gly Thr Glu Gly Pro Pro Thr Leu 
10 50 55 60 



15 



35 



55 



ccg tta gee cct gag age egg gtg gac cct gag gtc ctt cac age ctg 
Pro Leu Ala Pro Glu Ser Arg Val Asp Pro Glu Val Leu His Ser Leu 
65 70 v 75 80 

cag aca caa etc aag get cag aac age agg ate cag caa etc ttc cac. 
Gin Thr Gin Leu Lys Ala Gin Asn Ser Arg He Gin* Gin Leu Phe His 

85 ; '. '90 - s 95 



cag cat ctg caa age cag ttt ggc etc ctg gac cac aag cac eta gac 
25 Gin His Leu Gin Ser Gin Phe Gly .Leu Leu Asp His Lys His Leu Asp 
115 - 120 125 . 



cag cca gtt gac ccg get cac aat gtc age cgc ctg cac egg ctg ccc 

Gin Pro Val Asp Pro Ala His Asn Val Ser. Arg; Leu His Arg Leu Pro 

145 150 155 160 

» t . ' ■ ■ « ; " ' 

agg gat tgc cag gag ctg ttc cag gtt ggg gag agg cag agt gga eta 

Arg Asp Cys Gin Glu Leu Phe Gin Val Gly Glu Arg Gin. Ser Gly Leu 

165 170 ' f . 175 



240 



288 



20 aag gtg gee cag cag cag egg cac ctg gag aag cag, cac ctg cga att 336 
Lys Val Ala Gin Gin Gin Arg His Leu Glu Lys Gin His Leu Arg He 
100 105 k • .110 - 



384 



cat gag gtg gee aag cct gee cga aga aag agg ctg ccc gag atg gee 432 
His Glu Val Ala Lys Pro Ala Arg Arg Lys Arg Leu Pro Glu Met Ala 
30 130 135 1 140 



480 



528 



40 ttt gaa ate cag cct. cag ggg tct ccg cca ttt ttggtg aac tgc aag , 576 
Phe Glu He Gin Pro Gin Gly Ser Pro Pro Phe Leu Val Asn Cys Lys 

180 185 , . - **' 190 ■•«-.■ 

atg acc tea gat gga ggc tgg aca gta att cag agg cgc cac gat ggc 624 
45 Met Thr Ser Asp Gly Gly Trp Thr Val Ile ; Gln Arg Arg His Asp Gly 
195 200 205 

tea gtg gac ttc aac egg ccc tgg gaa gee tac aag gcg ggg ttt ggg 672 
Ser Val Asp Phe Asn Arg Pro Trp Glu Ala Tyr Lys Ala Gly Phe Gly 
50 210 * 215' .220 



gat ccc cac ggc gag ttc tgg ctg ggt ctg gag ( aag gtg cat age ate 

Asp Pro His Gly Glu Phe Trp Leu Gly Leu Glu Lys Val His Ser He 

225 230 235 240 

acg ggg gac cgc aac age cgc ctg gee gtg;cag ctg egg gac tgg gat 

Thr Gly Asp Arg Asn Ser Arg Leu Ala ,Val Gin Leu Arg Asp Trp Asp 

245 250 ' ; 255 



720 



768 



60 ggc aac gee gag ttg ctg cag ttc tec gtg cac. ctg ggt ggc gag gac 4 816 
Gly Asn Ala Glu Leu Leu Gin Phe Ser Val His Leu Gly Gly Glu Asp 
260 265 270 



WO 99/45135 • -8- 

aca qcc tat age ctg cag etc' act gca ccc gtg. gee ggc cag .ctg. ggc. 864 
Thr III Ty r Ser Leu Gin Leu Thr Ala Pro Val Ala Gly Gin Leu Gly 
275 4 280 ,285 



acc acc acc gec cca ccc age ggc etc tec gta ccc ttc tec act tgg 912 
■ Si Thr Tnr Val Pro Pro Ser Gly Leu Ser Val Pro Phe Ser Thr Trp 

290 :. _ v ;• <; 3°° 

10 qac cag gat cac gac etc cgc'.agg. gac aag aac' tgc gee aag age etc 960 
As P Gin Lp His Lp Liu Arg Arg Asp Lys Asn Cys Ala Lys Ser Leu 
• 30S 31 °. . . . \ - ■ : 

' tct gga ggc tgg tgg ttt ggc acc tgc age cat tec aac etcaac ggc 1008 
15 Ser Sy Sy TrJ Tr? Phe Gly Thr Cys Ser His Ser Asn Leu Asn Gly 



325 : f. 



caa tac ttc cgc tec ate cca cag cag egg cag aag ctt aag aag gga 1056 
■ ' Gin Tyr Al Arg Ser lie* Pro Gin Gin Arg Gin. Lys Leu Lys Lys Gly 
20 340 ; 345. ., ... 350 

ate ttc tgg - aag ace tgg egg ggc cgc -tac tac ceg etg cag gee ace 1104 
lie Phe Tr? Lys Thr . Trp Arg Gly Arg. Tyr Tyr Pro- Leu Gin Ala Thr 

' 360 365 



355 



acc atg ttg ate cag cccatg gca- gca gag gca gec tec 
Thr Met Leu lie Gin Pro Met Ala Ala. Glu Ala Ala Ser 

370* < , > 375 / , V V 380 : . * 

30 , - ■/■> • 'V • 

<2io> 5 ■ : * \ 

<2ii> 381 * ^ *■ \. * . - 

<212> PRT - . . 

: <213> Homo sapiens * . 

35 . ! v < .\ •* ; . ' ; / * 

G^°Pro Val Gin Ser Lys Ser Pro Arg Phe Ala Ser Trp Asp Glu Met " 

i • 5 10 15 

40 Ash Val Leu,Ala His , Gly Leu Leu Gin Leu' Gly Gin Gly Leu Arg Glu 



,20 



45 



25 ; ■ . ■ 30 

His Ala Glu, Arg Thr Arg Ser Gln ^eu Ser Ala Leu Glu Arg Arg Leu 

35 ' • ' ; 40 \ . 45 

. * ■■*', , * 

ser Ala Cys Gly Ser Ala Cys Gin Gly. Thr Glu Gly Pro Pro Thr Leu 

50 . ' ' 55 - • 60 

Pro Leu Ala Pro Glu Ser Arg Val Asp Pro Glu; Val Leu His Ser Leu 
50 65 . 70 75 

Gin Thr Gin Leu Lys Ala Gin . Asn Ser Arg lie Gin Gin Leu Phe His 
.. 85 • V ' V ' 90 

55 Lys Val Ala Gin Gin Gin Arg His Leu Glu Lys Gin His Leu Arg lie 

100 ; - 105 V ■ • 

Gin His Leu Gin Ser Gin Phe. Gly Leu Leu Asp His Lys His Leu Asp 
115 120 
60 His Glu Val Ala Lys Pro Ala Arg Arg Lys. Arg Leu Pro Glu Met Ala 

■ 130 . '135' ' • • 140 
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Gin Pro Val Asp Pro Ala His Asn Val. Ser Arg Leu His Arg Leu Pro 

145 150 155 160 

5 . Arg Asp Cys Gin Glu Leu Phe Gin Val Gly Glu Arg Gin Ser Gly Leu 
165 170 .175 



10 



25 



40 



Phe Glu He Gin Pro Gin Gly Ser Pro Pro Phe Leu Val Asn Cys Lys 
180 185 r 190. 

Met Thr Ser Asp Gly Gly Trp Thr Val' He Gin Arg Arg His Asp. Gly 
195 200 ' 205 



1 Ser Val Asp Phe Asn Arg Pro Trp Glu Ala Tyr Lys Ala Gly Phe Gly 

15 210 215 220 

Asp Pro His Gly Glu Phe Trp Leu Gly Leu Glu Lys Val His Ser He 

225 230 235 , 240 

20 Thr Gly Asp Arg Asn Ser Arg Leu Ala Val Gin Leu Arg Asp Trp Asp 

245 250 . 255 



Gly Asn Ala Glu Leu Leu Gin Phe Ser Val His Leu Gly Gly Glu Asp 
260 '265 ; 270 , 

Thr Ala Tyr Ser Leu Gin Leu Thr Ala Pro Val Alady Gin Leu Gly 
275 280 285 



Ala Thr Thr Vai Pro Pro Ser Gly Leu Ser Val Pro Phe Ser Thr Trp 
30 290 295 300 

*. Asp Gin Asp His Asp Leu Arg Arg Asp Lys Asn Cys Ala Lys Ser Leu. 

305 310 315 320 

35 Ser Gly Gly Trp Trp Phe Gly Thr Cys Ser His Ser Asn Leu Asn Gly 
325 ' 330 '335 



Gin Tyr Phe Arg Ser He Pro Gin Gin Arg Gin Lys Leu. Lys Lys Gly 
,340 345 • 350 

He Phe Trp Lys Thr Trp Arg Gly Arg Tyr Tyr Pro Leu Gin Ala Thr 
c355 360 365 



Thr Met Leu He Gin Pro Met Ala Ala Glu Ala Ala Ser 
45 370 375 380 



<210> 6 
<211> 651 
50 <212> DNA 

<213> Homo sapiens ' - 

<220> 
<221> CDS 
55 <222>.(1)..(651) 

<400>V 

ctg ccc agg gat tgc cag gag ctg ttc cag gtt ggg gag agg cag agt . 48 

Leu Pro Arg Asp Cys Gin Glu Leu Phe Gin Val Gly Glu Arg Gin Ser 
60 .1 5 10 15 

gga eta ttt gaa ate cag cct cag ggg tct ccg cca ttt ttg gtg aac 96 



SUBSTITUTE SHEET (RULE 26) 



15 



35 



ttt ggg gat ccc cac ggc gag ttc tgg ctg ggt ctg. gag aag gtg cat 
Phe Gly Asp Pro His Gly Glu Phe Trp Leu Gly Leu Glu Lys Val His 
65 70 \ 75 • 80 

age ate acg ggg gac cgc aac age cgc ctg gee gtg cag ctg egg gac 
Ser lie Thr Gly Asp Arg Asn Ser Arg Leu Ala Val Gin Leu Arg Asp 
85 90 95 



20 tgg gat ggc aac gec gag ttg ctg cag ttc tec gtg cac ctg ggt ggc 
Trp Asp Gly Asn Ala Glu Leu Leu Gin Phe Ser Val His Leu Gly Gly 
100 105 U0 

gag gac acg gec tat age ctg cag etc act gca ccc gtg gee ggc cag 
25 Glu Asp Thr Ala Tyr Ser Leu Gin Leu Thr Ala Pro Val Ala Gly Gin 
115 120 125 

ctg ggc gec acc acc gtc cca ccc age ggc' etc tec gta ccc ttc tec 
Leu Gly Ala Thr Thr Val Pro Pro Ser Gly Leu Ser Val Pro Phe Ser 
30 130 135 . 140 

act tgg gac cag gat cac gac etc cgc.agg gac. aag aac tgc gec aag 
Thr Trp Asp Gin Asp His Asp Leu Arg Arg Asp Lys Asn Cys Ala Lys 
14 5 150 155 160 

age etc tct gga ggc tgg tgg ttt ggc acc tgc age cat tec aac etc 
Ser Leu Ser Gly Gly Trp. Trp Phe Gly Thr Cys Ser His Ser Asn Leu 
| 165 170 175 



aag gga ate ttc tgg aag acc tgg egg ggc cgc tac tac ccg ctg cag 
45 Lys Gly lie Phe Trp Lys Thr Trp Arg Gly Arg Tyr Tyr Pro Leu Gin 
195 200 205 

gec acc acc atg ttg ate cag ccc atg 
Ala Thr Thr Met Leu lie Gin Pro Met 
50 210 215 



<210> 7 
<211> 217 
55 <212> PRT 

<213> Homo sapiens 

<400> 7 

Leu Pro Arg Asp Cys Gin Glu Leu Phe Gin Val Gly Glu Arg Gin Ser 
60 1 / , 5 10 15 

Gly Leu Phe Glu lie Gin Pro Gin Gly Ser Pro Pro Phe Leu Val Asn 



144 



WO 99/45135 _ l0r 

Gly Leu Phe Glu lie Gin Pro Gin Gly Ser Pro Pro Phe, Leu Val Asn 
20 25 30 

tgc aag atg acc tea gat gga ggc tgg aca gta att cag agg cgc cac 
5 Cys Lys Met Thr Ser Asp Gly Gly Trp Thr Val He Gin Arg Arg His 
35 40 45 

gat ggc tea gtg gac ttc aac egg ccc tgg gaa .gec tac aag gcg ggg 192 
Asp Gly Ser Val Asp Phe Asn Arg Pro Trp Glu Ala Tyr Lys Ala Gly 
10 50 55 * 60 



PCT/US99/04548 



240 



288 



336 



3 84 



432 



480 



528 



40 aac ggc cag tac ttc cgc tec ate cca cag cag egg cag aag ctt aag 576 
Asn Gly Gin Tyr Phe Arg Ser He Pro Gin Gin Arg Gin Lys Leu Lys 
180 185 190 



624 



651 



WO 99/45135 
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20 25 30 

Cys Lys. Met Thr Ser Asp Gly Gly Trp Thr Val lie Gin Arg Arg His 
35 40 45 

5 - 

Asp Gly Ser Val Asp Phe Asn Arg Pro Trp Glu Ala Tyr Lys Ala Gly 
50 55 60 

Phe Gly Asp Pro His Gly Glu Phe Trp Leu Gly Leu Glu Lys Val His 
10 65 70 * 75 80 

Ser He Thr Gly Asp Arg Asn Ser Arg Leu Ala Val Gin Leu- Arg Asp 
85 90 95 

15 Trp Asp Gly Asn Ala Glu Leu Leu Gin Phe Ser Val His Leu Gly Gly 
100 105 HO 

Glu Asp Thr Ala Tyr Ser Leu Gin Leu Thr Ala Pro Val Ala Gly Gin 
115 120 125 

20 

Leu Gly Ala Thr Thr Val Pro Pro Ser Gly Leu Ser Val Pro Phe Ser 
130 135 * 140 

Thr Trp Asp Gin Asp His Asp Leu Arg Arg Asp Lys Asn Cys Ala Lys 
25 145 150 155 160 

Ser Leu Ser Gly Gly Trp Trp Phe Gly Thr Cys Ser His Ser Asn Leu 
165 .170 175 

30 Asn Gly Gin Tyr Phe Arg Ser He Pro Gin Gin Arg Gin Lys Leu Lys 
1B0 185 190 

Lys Gly He Phe Trp Lys Thr Trp Arg Gly Arg Tyr Tyr Pro Leu Gin 
195 200 205 

35 4 •' * ' 

Ala Thr Thr Met Leu He Gin Pro Met 
210 215 



40 <210> 8 

<211> 496 
<212> PRT 

<213> Homo sapiens ' 
45 <400> 8 

Met Trp Gin He Val Phe Phe Thr Leu Ser Cys Asp Leu Val Leu Ala 

1 '5 10 .15 

Ala Ala Tyr Asn Asn Phe Arg Lys Ser Met Asp Ser He Gly Lys Lys 
50 * 20 25 30 

Gin Tyr Gin Val Gin His Gly Ser Cys Ser Tyr Thr Phe Leu Leu Pro 
35 V * 40 45 

55 Glu Met Asp Asn Cys Arg Ser Ser Ser Ser Pro Tyr Val Ser Asn Ala 
50 55 60 

Val Gin Arg Asp Ala Pro Leu Glu Tyr Asp Asp Ser Val Gin Arg Leu 
65 " 70 75 80 

60 

. Gin Val Leu Glu Asn He Met Glu Asn Asn Thr Gin Trp Leu Met Lys 
85 90 95 
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-12- 



Leu 



Glu Asn Tyr lie Gin Asp Asn Met Lys^ys Glu Met Val OXu He 
100, . ; *os 110 

Ala Val Gin Asn Gin Thr Ala Val , Met lie Glu lie Gly 



5 Gin Gin Asn 

. 5 115 



120 



125 



Thr Asn Leu Leu Asn 

130 "5 



Gin Thr Ala Glu Gin Thr Arg Lys Leu Thr Asp 

140 



10 Glu Ala Gin Val Leu Asn Gin Thr Thr Arg Leu Glu Leu Gin Leu 



Val 
145 

* Leu Glu His Ser Leu Ser Thr Asn Lys Le^Glu Lys Gin lie Leu Asp . 

Glu lie Asn Lys Leu Gin Asp Lys Asn Ser Phe Leu Glu 



Gin Thr Ser 



180 



185 ' 



190 



20 Lys 



Lys Val Leu Ma Met Glu Asp 'Lys His lie lie Gin Leu Gin Ser 



195 



200 



lie Lys Glu Glu Lys Asp qln Leu Gin Val Leu Val Ser Lys Gin Asn 



25 



30 



210 



Ser lie 
225 



Asn Ser 



215 



He Glu Glu Leu Glu Lys Lys He Val Thr Ala Thr Val Asn 



230 



235 



Val ,Leu Gin Lys Gin Gin His Asp Leu Met Glu Thr Val Asn 



245 



250 



Asn Leu Leu 



Thr Met Met, Ser Thr Ser Asn Ser Ala Lys Asp Pro Thr 



260 



265 



270 



35 Val Ala Lys 
275 



Glu Glu Gin lie Ser Phe Arg Asp. Cys Ala Glu Val Phe 



280 



285 



Lys Ser 
290 



Gly His Thr Thr Asn Gly He Tyr Thr Leu Thr Phe Pro Asn 



295 



40 



45 



50 



55 



Glu lie Lys Ala Tyr Cys^Asp Met Glu.Ala Gly Gly Gly 



310 



315 



Ser Thr Glu 
305 

Gly Trp Thr He lie Gin Arg Arg Glu Asp Gly Ser Val Asp Phe Gin 

325 330 -. 



Arg Thr Trp Lys Glu Tyr Lys Val Gly Phe Gly Asn Pro Ser Gly Glu 
Tyr Trp Leu Gly Asn Glu Phe Val Ser Gin Leu Thr Asn Gin Gin Arg 

* o c r\ 365 



355 



360 



Tyr 



Val Leu Lys lie His Leu Lys Asp Trp Glu Gly Asri Glu Ala Tyr 



370 



375 



380 



Ser Leu Tyr Glu His Phe Tyr Leu Ser Ser Glu Glu Leu Asn Tyr Arg 



385 3 9° 
lie His Leu Lys Gly Leu Thr Gly Thr Ala Gly Lys lie Ser Ser lie 
60 405 .410 "5 

Gin Pro Gly Asn Asp Phe Ser . Thr Lys Asp Gly Asp Asn Asp Lys 



Ser 
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420 425 430 

Cys He Cys Lys Cys Ser Gin Met Leu Thr Gly Gly Trp Trp Phe Asp 
435 440 445 

5 Ala Cys Gly Pro Ser Asn Leu Asn Gly Met Tyr Tyr Pro Gin Arg Gin 
450 455 460 

Asn Thr Asn Lys Phe Asn Gly lie * Lys Trp Tyr Tyr Trp Lys Gly Ser 
10 465 470 * 475 4B0 

Gly Tyr Ser Leu Lys Ala Thr Thr Met Met lie Arg Pro Ala Asp Phe 
485 490 495 

is . • ; 

<210> 9 
<211> 496 
<212> PRT 

20 <213> Mus musculus ; 
<400> 9 

Met Trp Gin He He Phe Leu Thr Phe Gly Trp Asp Leu Val Leu Ala 

• i 5 .10 . 15 



25 



40 



55 



Ser Ala Tyr Ser Asn Phe Arg Lys Ser Val • Asp Ser Thr Gly Arg Arg 
20 .25 J 30 



Gin Tyr Gin Val Gin Asn Gly Pro Cys Ser Tyr Thr Phe Leu Leu Pro 
30 35 40 45 

Glu Thr Asp Ser Cys Arg Ser Ser Ser Ser Pro Tyr Met Ser Asn Ala 
50* 55 60 

35 Val Gin Arg Asp Ala Pro Leu Asp Tyr Asp Asp Ser Val Gin Arg Leu 
65 " 70 75 80 



Gin Val Leu Glu Asn He Leu Glu Asn Asn Thr Gin Trp Leu Met Lys 
85 90 95 

Leu Glu Asn Tyr He Gin Asp Asn Met Lys Lys Glu Met Val Glu. He 
100 105 HO ' 



Gin Gin Asn Val Val' Gin Asn Gin Thr Ala Val Met He Glu He Gly 
45 115 ■ 120 125 

Thr Ser Leu Leu Asn Gin. Thr Ala Ala Gin Thr Arg Lys Leu Thr Asp 
130 135 140 ■ 

50 Val Glu Ala Gin Val Leu Asn Gin Thr Thr Arg Leu Glu Leu Gin Leu 
145 ISO 155 160 



Leu Gin His Ser He Ser Thr Asn Lys Leu Glu Lys Gin He Leu Asp 
165 170 175 

Gin Thr Ser Glu He Asn Lys Leu Gin Asn Lys Asn Ser Phe Leu Glu 
180 185 190 



Gin Lys Val Leu Asp Met Glu Gly Lys His Ser Glu Gin Leu Gin Ser 
60 195 200 205 

Met Lys Glu Gin Lys Asp Glu Leu Gin Val Leu Val Ser Lys Gin Ser 
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210 



215 ' . 220 



Ser Val lie Asp Olu Leu Glu Lys Lye Leu Val Thr Ala Thr Val Asn 
' 225 230 2 35 240 

5 Asn Ser Leu Leu Gin Lys Gin Gin His Asp Leu Met Glu Thr Val Asn 
245 250 255 

Ser Leu Leu Thr Met Met Ser Ser Pro Asn Ser Lys Ser Ser Val Ala 
10 260 265 270 

He Arg Lys Glu Glu Gin Thr Thr Phe Arg Asp Cys Ala Glu He Phe 

285 



275 



280 



15 Lys Ser Gly Leu Thr Thr Ser Gly lie Tyr Thr Leu Thr Phe Pro Asn 
290 295 300 

Ser Thr Glu Glu He Lys Ala Tyr. Cys Asp Met, Asp Val Gly Gly Gly 
305 310 320 

20 Gly Trp Thr Val He Gin His. Arg Glu Asp Gly Ser Val Asp Phe Gin 
325 330 335 

Arg Thr Trp Lys Giu Tyr Lys Glu Gly Phe Gly Asn Pro Leu Gly Glu 
25 340 345 * 350 

* 

Tyr Trp Leu Gly Asn Glu Phe Val Ser Glh Leu Thr Gly Gin His Arg 
*. 355 360 365 

30 Tyr Val Leu Lys lie Gin Leu Lys Asp Trp Glu Gly Asn Glu Ala His 
370 375 380 

Ser Leu Tyr Asp His Phe Tyr Leu Ala Gly Glu Glu Ser Asn Tyr Arg 
385 390 395 40° 

35 lie His Leu Thr Gly Leu Thr Gly Thr . Ala Ala Lys lie Ser Ser He 
. 405 41° 415 

Ser Gin Pro Gly Ser Asp Phe Ser Thr Lys Asp Ser Asp Asn Asp Lys 
40 420 425 • 430 

Cys lie Cys Lys Cys Ser Gin Met Leu Ser Gly Gly Trp Trp Phe Asp 
.435 440 445 

45 Ala Cys Gly Pro Ser Asn Leu Asn Gly ? Gln Tyr Tyr Pro Gin Lys Gin 



450 



455 



460 



Asn Thr -Am Lys Phe Asn Gly He Lys Trp Tyr Tyr Trp Lys Gly Ser 
465 470 ... 475 480 

50 Gly Tyr Ser Leu Lys Ala Thr Thr Met Met He Arg Pro Ala Asp Phe 
485 490 495 



55 



<210> 10 
<211> 326 
60 <212> PRT 

<213> Homo sapiens 
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<400> 10 

Met Glu Leu Ser Arg Val Ala Val Ala Leu Gly Pro Thr Gly Gin Leu 
1 5 10 15 

5 Leu Leu Phe Leu Ser Phe Gin Thr Leu Ala Ala Gin Ala Ala Asp Thr 
20 25 30 

Cys Pro Glu Val Lys Val Val Gly Leu Glu Gly Ser Asp Lys Leu Ser 
35 * 40 45 

10 

lie Leu Arg Gly Cys Pro Gly Leu Pro Gly Ala Ala Gly Pro Lys Gly 
50 55 60 

Glu Ala Gly Ala Asn Gly Pro Lys Gly Glu Arg Gly Ser Pro Gly Val 
15 65 70 75 80 

Val Gly Lys Ala Gly Pro Ala Gly Pro Lys Gly Asp Arg Gly, Glu Lys 
85 90 95 

20 Gly Ala Arg Gly Glu Lys Gly Glu Pro Gly Gin Leu Gin Ser Cys Ala 
100 105 'HO 

Thr Gly Pro Arg Thr Cys Lys Glu Leu Leu Thr Arg Gly His Phe Leu 
115 120 125 

25 

Ser Gly Trp His Thr He Tyr Leu Pro Asp Cys Gin Pro Leu Thr Val 
130 135 140 

Leu Cys Asp Met Asp Thr Asp Gly Gly Gly Trp Thr Val Phe Gin Arg 
30 145 150 155 160 

Arg Ser Asp Gly Ser Val Asp Phe Tyr Arg Asp Trp Ala Ala Tyr Lys 
165 170 175 

35 Arg Gly Phe Gly Ser Gin Leu Gly Glu Phe Trp Leu Gly Asn Asp His 
180 185 190 

He His Ala Leu Thr Ala Gin Gly Thr Ser Glu Leu Arg Val Asp Leu 
195 200 205 

40 

Val Asp Phe Glu Gly Asn His Gin Phe Ala Lys Tyr Arg Ser Phe Gin 
210 215 220 

Val Ala Gly Glu Ala Glu Lys Tyr Lys Leu Val Leu Gly Gly Phe Leu 
45 225 230 235 240 

Glu Gly Asn Ala Gly Asp Ser Leu Ser Ser His Arg Asp Gin Phe Phe 
245 250 255 

50 Ser Thr Lys Asp Gin Asp Asn Asp Asn His Ser Gly Asn Cys Ala Glu 
260 265 270 

Gin Tyr His Gly Ala Trp Trp Tyr Asn Ala Cys His Ser Ser Asn Leu 
275 280 205 

55 Asn Gly Arg Tyr Leu Arg Gly Leu His Thr Ser Tyr Ala Asn Gly Val 
290 295 . 300 

Asn Trp Arg Ser Gly Arg Gly Tyr Asn Tyr Ser Tyr Gin Val Ser Glu 
60 305 310 315 320 

Met Lys Val Arg Leu Thr 
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325 



<210> 11 
5 <211> 1893 
<212> DNA 
<213> Homo sapiens 

<220> 
10 <221> CDS 

<222> (149) . . (1378) 



15 



20 



g C cgacccac gcgtccggct ccagatcttc ttctgcacca gagcaagtct aagtctgagc 60 
cggctccccc agaactccag ctgctgggtc ttgaactcct gcgttccgga gtcctagcgt 120 
tgctgcaccc aaggccaccc ccagaatc atg cgc tgc get cog aca gca ggc 172 



1 



get gee ctg gtg eta tgc gcg get aet gcg ggg ett ttg age gcg caa 
Ala Ala Leu Val Leu Cys Ala Ala Thr Ala Gly Leu Leu Ser Ala Gin 
10 IS 20 

25 aaa cac cct gca cag cca gag cca ccg cgc ttt gca tec tgg gac gag 
SJ. 5 Pro Ala Gin Pro Glu Pro Pro Arg Pbe Ala Ser Trp Asp Glu 
25 30 " 

10 ata aac ttq ctg get cac ggg ctg eta cag cte ggc cat ggg ctg cgc 
30 Me? Asn Leu Leu Ala His Leu Leu Gin Leu Gly His Gly Leu Arg 
45 5° 55 

aaa cac gtg gag cgc ace cgt ggg cag ctg ggc gcg ctg gag cgc cgc 
35 S h!s Va! Glu Arg Thr Arg Gly Gin Leu Gly Ala Leu Glu Arg Arg 
60 65 

ata act ace tat ggc aac get tgt cag ggg ccc aag gga aaa gat gca 
S Ala K C?s Sy Asn Ala Cys Gin Gly Pro Lys Gly Lys Asp Ala 
40 75 80 85 

' ?° s t» c = a s s c = S = s ; " H 

90. 95 WO 

45 act ctg cag agt ttg cag act cag etc aag get eaa aac age aag ate 
£ Leu Gin Ser Leu Gin Thr Gin Leu Lys Ala Gin Asn Ser Lys lie 
105 "° 115 

„=„ Ma tto ttc caa aag gtg gec cag cag cag aga tac eta tea aag 
Gin Gin 2 Pne Gin Lys Va! Ala Gin Gin Gin Arg Tyr Leu Ser Lys 
125 »° " 5 

caa aat ctg aga at a cag aat ctt cag age cag ata gac etc ttg gee 
Gin Asn Leu Arg He Gin Asn Leu Gin Ser Gin lie Asp Leu Leu Ala 
140 145 , 

ccc acg cac eta gac aat gga gta gac aag act teg agg gga aag agg 
Ho Thf Til Leu Asp Asn Gly Val Asp Lys Thr Ser Arg Gly Lys Arg 

160 165 



316 



364 



.412 



460 



50 



55 



60 155 

ctt ccc aag atg acc cag etc att ggc ttg act ccc aac gec acc cac 



556 



604 



652 



700 
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Leu Pro Lya Met Thr Gin Leu lie Gly Leu Thr Pro Asn Ala Thr His 

170 175 180 

tta cac agg ccg ccc egg gac tgc cag gaa etc ttc caa gaa ggg gag 

Leu His Arg Pro Pro Arg Asp Cys Gin Glu Leu Phe Gin Glu Gly Glu 

185 190 195 ; 200 



748 



egg cac agt gga ctt ttc cag ate cag cct ctg ggg tct cca cca ttt 

,Arg His Ser Gly Leu Phe Gin lie Gin Pro Leu Gly Ser Pro Pro Phe 
10 205 210 215 

ttg gtc aac tgt gag atg act tea gat gga ggc tgg aca gtg att cag 

• Leu Val Asn Cys Glu Met Thr Ser Asp Gly Gly Trp Thr Val lie Gin 
220 225 230 



15 



aga cgc ctg aac ggc tct gtg gac ttc aac cag tec tgg gaa gee tac 
Arg Arg Leu Asn , Gly Ser Val Asp Phe Asn Gin Ser Trp Glu Ala Tyr 
235 240 245 



796 



844 



892 



20 aag gat ggc ttc gga gat ccc caa ggc gag ttc tgg ctg ggc ctg gaa 
Lys Asp Gly Phe Gly Asp Pro Gin Gly Glu Phe Trp Leu Gly Leu Glu 
250 255 260 



940 



aag atg cac age ate aca ggg aac cga gga age caa ttg get gtg cag 
Lys Met His Ser lie Thr Gly Asn Arg Gly Ser Gin Leu Ala Val Gin 
265 270 275 280 

etc cag gac tgg gat ggc aat gee aaa ttg etc caa ttt ccc ate cat 
Leu Gin Asp Trp Asp Gly Asn Ala Lys Leu Leu Gin Phe Pro lie His 
285 290 295 

ttg ggg ggt gag gac aca gee tac age ctg cag etc act gag ccc acg 
Leu Gly Gly Glu Asp Thr Ala Tyr Ser Leu Gin Leu Thr Glu Pro Thr 
300 305 310 

gee aat gag ctg ggt gec ace aat gtt tec ccc aat ggc ctt tec ctg 
Ala Asn Glu Xeu Gly Ala Thr Asn Val Ser Pro Asn Gly Leu Ser Leu 
315 4 - 320 325 

ccc ttc tct act tgg gac caa gac cat gac etc cgt ggg gac ctt aac 
Pro Phe Ser Thr Trp Asp Gin Asp His Asp Leu Arg Gly Asp Leu Asn 
330 . 335 , 340 

1 tgt gee aag age etc tct ggt ggc tgg tgg ttt ggt ace tgt age cat 
^Cys Ala Lys Ser Leu Ser Gly Gly Trp Trp Phe Gly Thr Cys Ser His 
345 350 355 360 

tec aat etc aat gga caa tac ttc cac tct ate cca egg caa egg cag 
Ser Asn Leu Asn Gly Gin Tyr Phe His Ser He Pro Arg Gin Arg Gin 
365 370 375 

gag cgt aaa aag ggt ate ttc tgg aaa aca tgg aag ggc cgc tac tat 
Glu Arg Lys Lys Gly He Phe Trp Lys Thr Trp Lys Gly Arg Tyr Tyr 
380 385 390 

cct ctg cag get ace ace ctg ctg ate cag ccc atg gag get aca gca 
Pro Leu Gin Ala Thr Thr Leu Leu He Gin Pro Met Glu Ala Thr Ala 
395 400 405 



60 gee tct tagcctcctc actggagect ggttccaggc ctaagaagac agtgactttg 
- • Ala Ser v 
410 



25 



30 



35 



40 



45 



50 



55 



988 



1036 



1084 



1132 



1180 



1228 



1276 



1324 



1372 



1428 
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- 


gttgtggccc 


tgagatttgg ccattctctg ctgggggcag gagctctaag 


tagggctatc 


1488 


5 


tgcgtcttgt 


ggacaaagaa gaagcccgta actggagaga ctggaggacc 


ccttttccgt 


1548 




gttggggtct 


gcaagcattg ttgtctgaaa.cagtcagagc aacaggaaac 


aaatggccca 


1608 




gatccagaaa 


acatgggctc gaggggcact gaatatcact tctcgcctac 


cagagaagtt 


1668 


10 


ggggatgcag 


agggaccact acagtccaac tagctgggcc . cttaatggcg 


gactcagtca 


1728 




tattgactga 


ctggagacag ggtgc'cagga gccctggata cactcatggt 


gctgttgtag 


1788 


15 


gtgctgtgga 


tgcacaggtg ctaactgtgg ttcccaggca cagctcacag 


cattcttaca 


1848* 


ataaaaacaa 


cctcagaaca aaaaaaaaaa aaaaagggcg gccgc , 




1893 



<210> 12 ' 
20 <211> 410 . ■ -\ ■ 

<212> PRT > ■ 

<213> Homo sapiens „ - , 

<400>. 12 ; • . 

25 Met Arg Cys Ala Pro Thr Ala Gly Ala Ala Leu Val Leu Cys Ala Ala 
1 5 . • 10 15 

Thr Ala Gly Leu Leu Ser Ala Gin Gly Arg Pro Ala Gin Pro Glu Pro 
20 25 * 30 



30 



Pro Arg Phe Ala Ser Trp Asp Glu Met Asn Leu Leu Ala His Gly Leu 
35 40 45 • 



Leu Gin Leu Gly His Gly Leu Arg Glu His Val Glu Arg Thr Arg Gly 
35 50 55 60 

Gin Leu Gly Ala Leu Glu Arg Arg Met Ala Ala Cys Gly . Asn Ala Cys 
€S 70 / 75 80. 

40 Gin Gly Pro Lys Gly Lys Asp Ala Pro Phe Lys Asp Ser Glu Asp Arg 
85 90 ^95 

Val Pro Glu Gly Gin Thr Pro Glu Thr Leu Gin Ser lieu Gin Thr Gin 
100 - 105 11° 

45 ' ^ : : 

Leu Lys Ala Gin Asn Ser Lys lie Gin Gin Leu Phe Gin Lys Val Ala 
1X5 120 125 

Gin Gin Gin Arg Tyr Leu Ser Lys Gin Asn Leu Arg He Gin Asn Leu 
50 130 ^ 135 140 

Gin Ser Gin He Asp Leu Leu Ala Pro Thr His Leu Asp Asn Gly Val 
145 150 155 160 

55 Asp Lys Thr Ser Arg Gly Lys Arg Leu Pro Lys Met Thr Gin Leu lie 
165 170 175 



60 



Gly Leu Thr 1 Pro Asn Ala Thr His Leu His Arg Pro Pro Arg Asp Cys 
180 185 190 

Gin Glu Leu Phe Gin Glu Gly Glu Arg His Ser Gly Leu Phe Gin lie 

195 200 * 205 
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Gin Pro Leu Gly Ser Pro Pro Phe Leu Val Asn Cys Glu Met Thr Ser 
210 215 220 

5 Asp Gly Gly Trp Thr Val He Gin Arg Arg Leu Asn Gly Ser Val Asp 
225 230 235 240 

Phe Asn Gin Ser Trp Glu Ala Tyr Lys Asp Gly Phe Gly Asp Pro Gin 
245 250 255 



10 



25 



40 



55 



Gly Glu Phe Trp Leu Gly Leu Glu Lys. Met His Ser lie Thr Gly Asn 
260 265 270 . 



Arg Gly Ser Gin. Leu Ala Val Gin Leu Gin Asp Trp Asp Gly Asn Ala 
15 275 280 .285 

Lys Leu Leu Gin Phe Pro lie, His Leu Gly Gly Glu Asp Thr Ala Tyr 
290 295; , 300 

20 Ser Leu Gin Leu Thr Glu Pro Thr Ala Asn Glu Leu Gly Ala Thr Asn 
305 . 310 315 320 



Val Ser Pro Asn Gly Leu Ser Leu Pro Phe Ser Thr Trp Asp Gin Asp 
325 330 335 

His Asp Leu Arg Gly Asp Leu Asn Cys Ala Lys Ser. v Leu Ser Gly Gly 
340 345 ' 350; 



Trp Trp Phe Gly Thr Cys Ser His Ser Asn Leu Asn Gly Gin Tyr Phe 
30 355 360 365 

His Ser He Pro Arg Gin Arg Gin Glu Arg Lys Lys Gly He Phe Trp 
370 375 ' 380 

35 Lys Thr Trp Lys Gly Arg Tyr Tyr Pro Leu Gin Ala Thr- Thr Leu Leu 

385 390 395 400 



lie Gin Pro Met Glu Ala Thr Ala Ala Ser 
405 410 



<210> 13 ' \ 
<211> 1230 ' 
<212> DKA 
45 <213> Mus musculus 

<220> 

<221> CDS 

<222> (1) . . (1230) 

50 

<400> 13 * 

atg cgc tgc get ccg aca gca ggc get gee ctg gtg eta tgc gcg get 48 
Met Arg Cys Ala Pro Thr Ala Gly Ala Ala Leu. Val Leu Cys Ala Ala 

1 5 10 k * 15 

act gcg ggg ctt ttg age gcg caa ggg cgc cct gca cag cca gag cca ,96 
Thr Ala Gly Leu Leu Ser Ala Gin Gly Arg Pro Ala Gin Pro Glu Pro 
20 '25 30 

60 ccg cgc ttt gca tec tgg gac gag atg aac ttg ctg get cac ggg ctg 144 
Pro Arg Phe Ala Ser Trp Asp Glu Met Asn Leu Leu Ala His Gly Leu 
35 40 45 
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cta cag etc ggc cat ggg ctg cgc gaa cac gtg gag cgc acc cgt ggg 192 
Leu Gin Leu Gly His Gly Leu Arg Glu His Val Glu Arg Thr Arg Gly 
50 55 60. 

.5 • - v. / 

cag ctg ggc gcg ctg gag cgc cgc atg get gec tgt ggt aac get tgt 240 
Gin Leu Gly Ala Leu Glu Arg Arg Met Ala Ala Cys Gly Asn Ala Cys 
65 70 ■ 75 80 

10 cag ggg ccc aag gga aaa gat gca ccc ttc aaa gac tec gag gat aga 288 
Gin Gly Pro Lys Gly Lys Asp Ala. Pro Phe Lys Asp Ser Glu Asp Arg 

* 85 " ' 90 95 . 



gtc cct gaa ggc cag act cct gag act ctg cag agt ttg cag act cag 336 
15 Val Pro Glu Gly Gin Thr Pro Glu Thr Leu Gin Ser Leu Gin' Thr Gin 
' ■ * :"■ 100 * 105 * HO 

etc aag get caa aac age aag ate cag caa ttg ttc cag aag gtg gee 384 
Leu Lys Ala Gin Asn Ser Lys He Gin Gin Leu Phe Gin Lys Val Ala 
20 115 , r 120 . 125 

cag cag cag aga tac eta tea aag cag aat ctg aga ata cag aat ctt 432 

Gin Gin Gin Arg Tyr Leu Ser Lys Gin Asn Leu Arg lie Gin Asn Leu 
130 135 140 

25 - , 

cag age cag ata gac etc, ttg gee ccc acg cac eta gac aat gga gta 480 

Gin Ser Gin He Asp Leu Leu Ala Pro thr His Leu Asp Asn Gly Val 
145 150 155 160 

30 gac aag act teg agg gga aag agg ctt ccc aag atg acc cag etc att 528 
Asp Lys Thr Ser Arg Gly Lys Arg Leu Pro Lys Met Thr Gin Leu He 
165.: 170 175 

ggc ttg act ccc aac gee acc cac tta cac. agg ccg ccc egg gac tgc 576 
35 Gly Leu Thr Pro Asn Ala Thr His Leu His Arg Pro Pro Arg Asp Cys 
180 185 190 

cag gaa etc ttc caa gaa ggg gag egg cac agt gga ctt ttc cag ate 624 
Gin Glu Leu Phe Gin Glu Gly Glu Arg His Ser Gly Leu Phe Gin He 
40 195 200 205 . 

cag cct ctg ggg tct cca cca ttt ttg gtc aac tgt gag atg act tea 672 
Gin Pro Leu Gly Ser Pro Pro Phe Leu Val Asn Cys Glu Met Thr Ser. 
210 215 220 

45 

gat gga ggc tgg aca gtg att cag aga cgc ctg aac ggc tct gtg gac 720 
Asp Gly Gly Trp Thr Val He Gin Arg Arg teu Asn Gly Ser Val Asp 
225 230 235 , 240 

50 ttc aac cag tec tgg gaa gee tac aag gat ggc ttc gga gat ccc caa 768 
Phe Asn Gin Ser Trp Glu Ala Tyr Lys Asp Gly Phe Gly Asp Pro Gin 

245,: . 250 255 

? ggc gag ttc tgg ctg ggc ctg gaa aag atg cac age ate aca ggg aac 816 
55 Gly Glu Phe Trp Leu Gly Leu Glu Lys Met His Ser He Thr Gly Asn 
260 265 270 



cga gga age caa ttg get gtg cag etc cag gac tgg gat ggc aat gee 
. Arg Gly Ser Gin Leu Ala Val Gin Leu Gin Asp Trp Asp Gly Asn Ala 
60 275 280 285 



864 



aaa ttg etc caa ttt ccc ate cat ttg ggg ggt gag gac aca gee tac 912 



WO 99/45135 



■21 - 



PCT/US99/04548 



Lys Leu Leu Gin Phe Pro He His Leu Gly Gly Glu Asp Thr Ala Tyr 
290 295 300 



10 



15 



age ctg cag etc act gag ccc acg gec aat gag ctg ggt gec acc aat 

Ser Leu Gin Leu Thr Glu Pro Thr Ala Asn Glu Leu Gly Ala Thr Asn 

305 310 . 315 320 

gtt tec ccc aat ggc ctt tec ctg ccc ttc tct act tgg gac caa gac 

Val Ser Pro Asn Gly Leu Ser Leu Pro Phe Ser Thr Trp Asp Gin Asp 

325 330 335 

cat gac etc cgt ggg gac ctt aac tgt gee aag age etc tct ggt ggc 

His Asp Leu Arg Gly Asp Leu Asn Cys Ala Lys Ser Leu Ser Gly Gly 

340 345 350 

tgg tgg ttt ggt acc tgt age cat tec aat etc aat gga caa tac ttc 

Trp Trp Phe Gly Thr Cys Ser His Ser Asn Leu Asn Gly Gin Tyr Phe 

355 360 365 



960 



1008 



1056 



1104 



20 cac tct ate cca egg caa egg cag gag cgt aaa aag ggt ate ttc tgg 
His Ser lie Pro Arg Gin Arg Gin Glu Arg Lys Lys Gly He Phe Trp 
370' " 375 380 



1152 



aaa aca tgg aag ggc cgc tac tat cct ctg cag get acc' acc ctg ctg 
25 * Lys Thr Trp Lys Gly Arg Tyr. Tyr Pro Leu Gin Ala Thr Thr Leu Leu 
385 L 390 * 395 400 



30 



ate cag ccc atg gag. get aca gca gec tct 
He Gin Pro Met Glu Ala Thr Ala' Ala Ser 
405 410 



1200 



1230 



<210> 14 

<211> 1161 
35 <212> DNA 
y . <213> Mus musculus 

<220> 
<221> CDS 
40 <222> (1) . . (1161) 



45 



50 



<400> 14 

caa ggg cgc cct gca cag cca gag cca ccg cgc ttt gca tec tgg gac 

Gin Gly Arg Pro Ala Gin Pro Glu Pro Pro Arg Phe Ala Ser Trp Asp 
1 ' s 10 15 

gag atg aac ttg ctg get cac ggg ctg eta cag etc ggc cat ggg ctg 
Glu Met Asn Leu Leu Ala His Gly Leu Leu Gin Leu Gly His Gly Leu 
20 25 30 

cgc gaa cac gtg gag cgc acc cgt ggg cag ctg ggc gcg ctg gag cgc 
Arg Glu His Val Glu Arg Thr Arg Gly Gin Leu Gly Ala Leu Glu Arg 
35 40/ 45 

55 cgc atg get gee tgt ggt aac get tgt cag ggg ccc aag gga aaa gat 
. Arg Met Ala Ala Cys Gly Asn Ala Cys Gin Gly Pro Lys Gly Lys Asp 
50 .55 60 

gca ccc ttc aaa gac tec gag gat aga gtc cct gaa ggc cag act cct 
60 Ala Pro Phe Lys Asp Ser Glu Asp Arg Val Pro Glu Gly Gin Thr Pro 
65 > 70 75 80 



46 



96 



144 



192 



240 
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40 



60 



gag act ctg cag agt ttg cag act cag etc aag get caa aac age aag 
Glu Thr Leu Gin Ser Leu Gin Thr Gin Leu Lys Ala Gin Asn Ser Lys 
85 90 95 

ate cag caa ttg ttc cag aag gtg gee cag cag cag aga tac eta tea 
lie Gin Gin Leu Phe Gin Lys Val Ala Gin Gin Gin Arg Tyr Leu Ser 
100 105 * 110 



288 



336 



aag cag aat ctg aga ata cag aat. ctt cag age cag ata gac etc ttg 384 
10 Lys Gin Asn Leu Arg lie Gin Asn Leu Gin Ser Gin He Asp Leu Leu 
115 120 125 

gee ccc acg cac eta gac aat gga gta gac aag act teg agg gga aag 432 
Ala Pro Thr His Leu Asp Asn Gly Val Asp Lys Thr Ser Arg Gly Lys 
15 130 135 140 

agg ctt ccc aag atg ace cag etc att ggc ttg act ccc aac gee ace 480 
Arg Leu Pro Lys Met Thr Gin Leu He Gly Leu Thr Pro Asn Ala Thr 
145 150 155 160 

20 

cac tta cac agg ccg ccc egg gac tgc cag gaa etc ttc caa gaa ggg 528 
His Leu His Arg Pro Pro Arg Asp Cys Gin Glu Leu Phe Gin Glu Gly 
165 170 175 

25 gag egg cac agt gga ctt ttc cag ate cag cct ctg ggg tct cca cca 576 
Glu Arg His Ser Gly Leu Phe Gin He Gin Pro Leu Gly Ser Pro Pro 
180 185 190 

ttt ttg gtc aac tgt gag atg act tea gat gga ggc tgg aca gtg att 624 
30 Phe Leu Val Asn Cys Glu Met Thr Ser Asp Gly Gly Trp Thr Val lie 
195 , 200 205 

cag aga cgc ctg aac ggc tct gtg gac ttc aac cag tec .tgg gaa gee 672 
Gin Arg Arg Leu Asn Gly Ser Val Asp Phe Asn Glh Ser Trp Glu Ala 
35. 210 215 220 

- tac aag gat ggc ttc gga gat ccc caa ggc gag ttc tgg ctg ggc ctg 720 
Tyr Lys Asp Gly Phe Gly Asp Pro Gin Gly Glu Phe Trp Leu Gly Leu 
225 230 235 240 



gaa aag atg cac age ate aca ggg aac cga gga age caa ttg get gtg 768 
Glu Lys Met His Ser He Thr Gly Asn Arg Gly Ser Gin Leu Ala Val 
245 250 255 



864 



45 cag etc cag gac tgg gat ggc aat gee aaa ttg etc, caa ttt ccc ate 816 
Gin Leu Gin Asp Trp Asp Gly Asn Ala Lys Leu Leu Gin Phe Pro He 
260 265 t 270 

cat ttg ggg ggt gag gac aca gee tac age ctg cag etc act gag ccc 
50 His Leu Gly Gly Glu Asp Thr Ala Tyr Ser Leu Gin Leu Thr Glu Pro 
275 280 285 

acg gee aat gag ctg ggt gee ace aat gtt tec ccc aat ggc ctt tec 912 
Thr Ala Asn Glu Leu Gly Ala Thr Asn Val Ser Pro Asn Gly Leu Ser 
55 290 295 300 



ctg ccc ttc tct act tgg gac caa gac cat gac v ctc cgt ggg gac ctt 

Leu Pro Phe Ser Thr Trp Asp Gin Asp His Asp Leu Arg Gly Asp Leu 

305 310 315 320 

-aac tgt gec aag age etc tct ggt ggc tgg tgg ttt ggt ace tgt age 

Asn Cys Ala Lys Ser Leu Ser Gly Gly Trp Trp, Phe Gly Thr Cys Ser ' 



960 



1008 
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10 



325 330 335 

cat tec aat etc aac gga caa'tac ttc cac tct ate cca egg caa egg 1056 
His Ser Asn Leu Asn Gly Gin Tyr Phe His Ser He Pro Arg Gin Arg 
340 345 350 

cag gag cgt aaa aag ggt ate ttc tgg aaa aca tgg aag ggc cgc tac 1104 
Gin Glu Arg Lys Lys Gly He Phe Trp Lys Thr Trp Lys Gly Arg Tyr 
355 360 365 

tat cct ctg cag get ace ace ctg ctg ate cag ccc atg gag get aca 1152 
Tyr Pro Leu Gin Ala Thr Thr Leu Leu He Gin Pro Met Glu Ala Thr 
370 375 380 

15 gca gee tct 1161 
Ala Ala Ser 
385 



20 <210> 15 * 
<211> 387 
<212> PRT 

<213> Mus mus cuius 
25 <400> 15 

Gin Gly Arg Pro Ala Gin Pro Glu Pro Pro Arg Phe Ala Ser Trp Asp 
1 5 10 15 

Glu Met Asn Leu Leu Ala His Gly Leu Leu Gin Leu Gly His Gly Leu 
30 20 '25 30 

Arg Glu His Val Glu Arg Thr Arg Gly Gin Leu Gly -Ala Leu Glu Arg 
35 40* 45 

35 Arg Met Ala Ala Cys Gly Asn Ala Cys Gin Gly Pro Lys Gly Lys Asp 
50 55 60 

Ala Pro Phe Lys Asp Ser Glu Asp Arg Val ' Pro Glu Gly Gin Thr Pro 
65 ' 70 75 80 

Glu Thr Leu Gin Ser Leu Gin Thr Gin Leu Lys Ala Gin Asn Ser Lys 
85 90 95 



40 



lie Gin Gin Leu Phe Gin Lys Val Ala Gin Gin Gin Arg Tyr Leu Ser 
45 100 105 HO 

Lys Gin Asn Leu Arg He Gin Asn Leu Gin Ser Gin He Asp Leu Leu 
115 120 125 

50 Ala Pro Thr His Leu Asp Asn Gly Val Asp Lys Thr Ser Arg. Gly Lys 
130 135 • 140 . 



55 



Arg Leu Pro Lys Met Thr Gin Leu lie Gly Leu Thr Pro Asn Ala Thr 
145 150 155 160 

His Leu His Arg Pro Pro Arg Asp Cys Gin Glu Leu Phe Gin Glu Gly 
165 170 175 



Glu Arg His Ser Gly Leu Phe Gin lie Gin Pro Leu Gly Ser Pro Pro 
60 180 185^ * 190 

Phe Leu Val, Asn Cys Glu Met Thr Ser Asp Gly Gly Trp Thr Val He 
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195 ' \ 200 .: 205 

Gin Arg Arg Leu Asn Gly Ser Val Asp Phe Asn Gln^ Ser Trp Glu Ala 

210 ■ 215 220- * 

5 r 

Tyr Lys Asp Gly Phe Gly Asp Pro Gin Gly Glu Phe Trp Leu Gly Leu 
225 - 230 235 240 

Glu Lys Met His Ser lie" Thr Gly Asn Arg Gly Ser Gin Leu Ala Val 
10 245 250 2S5 

' ■ Y 

Gin Leu Gin Asp Trp Asp Gly Asn Ala Lys Leu Leu Gin Phe Pro Jle 

. ..... .,.„.., ... 260 , , .265 ... "... 270, . ., ,. 

15 His Leu Gly Gly Glu Asp Thr Ala ' Tyr Ser Leu Gin Leu Thr Glu Pro 
275' / 280 285 

Thr Ala Asn Glu Leu Gly Ala Thr Asn Val Ser Pro Asn Gly Leu Ser 
290 295 .300 

20 . « ' 

Leu Pro Phe Ser Thr Trp Asp Gin Asp His Asp Leu Arg Gly Asp Leu 

305 310 \ .315 * ...320 

Asn Cys Ala Lys Ser Leu Ser Gly Gly Trp Trp Phe Gly Thr Cys Ser 
25 325 > 330 335 

His Ser Asn Leu Asn Gly Gin Tyr Phe His Ser lie Pro Arg Gin Arg 
340 345 350 

30 Gin Glu Arg Lys Lys Gly He Phe Trp Lys Thr Trp Lys Gly Arg Tyr 
355 360 365 

Tyr Pro Leu Gin Ala Thr Thr Leu Leu He Gin Pro Met Glu Ala Thr 
370 375 380 

35 <\ ' .'• • ' 

Ala Ala Ser 

385' * * ' ; . ./ 



40 <210> 16 

<211> 651. ' : 

<212> DNA 

*<213> Mus musculus 

. 45 <220> 

<221> CDS 

<222> (1) . . (651) 

<400> 16 

50 ccg ccc egg gac tgc cag gaa etc ttc caa gaa ggg gag egg cac agt 48 

Pro Pro Arg Asp Cys Gin Glu Leu Phe Gin Glu Gly Glu Arg His Ser 
1 5 K 10 15 

gga ctt ttc cag ate cag cct ctg ggg tct cca cca ttt ttg gtc aac 96 
55 Gly Leu Phe Gin He Gin Pro Leu Gly Ser Pro Pro Phe Leu Val Asn 
20 25 30 

tgt gag atg act tea gat gga ggc tgg aca gtg att cag aga cgc ctg 144 
Cys Glu Met Thr Ser Asp Gly. Gly Trp Thr Val lie Gin Arg Arg Leu 
60 ^ 35 40 ' 7 45 

aac ggc tct gtg gac ttc aac cag tec tgg gaa gee tac aag gat ggc .192 
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•25- 

Asn Gly Ser Val Asp Phe Asn Gin Ser Trp Glu Ala Tyr Lys Asp Gly 
50 55 .60 

ttc gga gat ccc caa ggc gag ttc tgg ctg ggc ctg gaa aag atg cac 240 
Phe Gly Asp Pro Gin Gly Glu Phe Trp Leu Gly Leu Glu Lys Met His 
65 10 75 80 



288 



336 



384 



age ate aca ggg aac cga gga age caa ttg get gtg cag etc cag gac 
Ser He Thr Gly Asn Arg Gly Ser Gin Leu Ala Val Gin Leu Gin Asp 
10 85 90 95 

tgg gat ggc aat gec aaa ttg etc caa ttt ccc ate cat ttg ggg ggt 

Trp Asp Gly Asn Ala Lys Leu Leu Gin Phe Pro lie His Leu Gly Gly 

100 105 110 

15 

gag gac aca gee tac age ctg cag etc act gag ccc acg gee aat gag 

Glu Asp Thr Ala Tyr Ser Leu Gin Leu Thr Glu Pro Thr Ala Asn Glu 

115 120 125 

20 ctg ggt gee acc aat gtt tec ccc aat ggc ctt tec ctg ccc ttc tct 432 
Leu Gly Ala Thr Asn Val Ser Pro Asn Gly Leu Ser Leu Pro Phe Ser 
130 135 140 

act tgg gac caa gac cat gac etc cgt ggg gac ctt aac tgt gee aag 480 
25 Thr Trp Asp Gin Asp His Asp Leu Arg Gly Asp Leu Asn Cys Ala Lys 
145 150 155 160 

t, 

age etc tct ggt ggc tgg tgg ttt ggt acc tgt age cat tec aat etc- 528 
Ser Leu Ser Gly Gly Trp Trp Phe Gly Thr Cys Ser His Ser Asn Leu 
30 165 ; 170 175 

aat gga caa tac ttc cac tct ate cca egg caa egg cag gag cgt aaa 576 

Asn Gly Gin Tyr Phe His Ser He Pro Arg Gin Arg Gin Glu Arg Lys 

180 185 190 

35 

aag ggt ate ttc tgg aaa aca tgg aag ggc cgc tac tat cct ctg cag 624 

Lys Gly He Phe Trp Lys Thr Trp Lys Gly Arg Tyr Tyr Pro Leu Gin 

195 200 205 

40 get acc acc ctg ctg. ate cag ccc atg 651 
Ala Thr Thr Leu Leu He Gin Pro Met 
210 215 



45 <210> 17 

<211> 217 * 
<212> PRT 

<213> Mus musculus 
50 <400> 17 

Pro Pro Arg Asp Cys Gin Glu Leu Phe Gin Glu Gly Glu Arg His Ser 
1 . ' 5 10 15 

Gly Leu Phe Gin He Gin Pro Leu Gly Ser Pro Pro Phe Leu Val Asn 
55 20 25 30 

Cys Glu Met Thr Ser Asp Gly Gly Trp Thr Val He Gin Arg Arg Leu 
35 40 45 

60 Asn Gly Ser Val Asp Phe Asn Gin Ser Trp Glu Ala Tyr Lys Asp Gly 

50 * 55 60. 
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Phe Gly Asp Pro Gin Gly GXu Phe Trp Leu Gly Leu Glu Lys Met His 
65 70 75 . 80 

Ser lie Thr Gly Asn Arg Gly Ser Gin Leu Ala Val Gin Leu Gin Asp 
5 85 90 95 

Trp Asp Gly Asn Ala Lys Leu Leu Gin Phe Pro lie His Leu Gly Gly 
100 105 110 

10 Glu Asp Thr Ala Tyr Ser Leu Gin Leu Thr Glu Pro Thr Ala Asn Glu 
115 120 V 125 

Leu Gly Ala Thr Asn Val Ser Pro Asn Gly Leu Ser Leu Pro Phe Ser 
130 135 140 

15 Thr Trp Asp Gin Asp His Asp Leu Arg Gly Asp Leu Asn Cys Ala Lys 
145 150 I 55 160 

Ser Leu Ser Gly Gly Trp Trp Phe Gly Thr Cys Ser His Ser Asn Leu 
20 165 170 175 

Asn Gly Gin Tyr Phe His Ser lie Pro Arg Gin Arg Gin Glu Arg Lys 
180 190 

25 Lys Gly He Phe Trp Lys Thr Trp Lys Gly Arg Tyr Tyr Pro Leu Gin 
195 200 . 205 

Ala Thr Thr Leu Leu lie Gin Pro Met 
210 215 

30 ■ 

<210> 18 
<211> 20 
<212> DNA 
35 <213> synthetic construct 



<400> 18 

aattaaccct cactaaaggg 

40 

<210> 19 
<211> 32 
<212> DNA 

<213> synthetic construct 

45 

<400> 19 

acgcgtcgac taatacgact cactataggg eg 



50 <210> 20 
<211> 20 
. <212> DNA 

<213> synthetic construct 



20 



32 



55 <400> 20 

aattaaccct cactaaaggg 



<210> 21 
60 .<211> 62 
<212> DNA 

<213> synthetic construct 
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<400> 21 

tacgcgtcga cctaagcgta gtctgggacg tcgtatgggt aagaggctgc tctagcctcc 60 
at 62 



<210> 22 

<211> 8 

10 <212> PRT 

<213> synthetic construct 



<400> 22 

Asp Tyr Lys Asp Asp Asp Asp Lys 
15 1 .5 



<210> 23 

<2U> 35 

20 <212> DNA 

<213> synthetic construct 



25 



35 



40 



45 



.50 



<400> 23 

tttttcaatt gaccgccatg agcggtgctc cgacg 35 



<210> 24 

<2li> 59 

<212> DNA 

30 <213> synthetic construct 

<400> 24 

tttttgtcga cttatcactt gtcgtcgtcg tccttgtagt cggaggctgc ctctgctgc 59 



<210> 25 
<211> 9 
<212> PRT 

<213> synthetic construct 
<400> 25 

Tyr ' Pro Tyr Asp Val Pro Asp Tyr Ala 

• ' 1 5 



<210> 26 

<211> 22 

<212> PRT 

<213> synthetic construct 



<400> 26 

Lys Thr Trp Arg Gly Arg Tyr Tyr Pro Leu Gin Ala Thr Thr Met Leu 
15 10 15 

55 He Gin Pro Met Ala Ala 
20 



<210> 27 
60 <211> 16 
<212> PRT 

<213> synthetic construct 
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<400> 27 

Pro Val Gin Ser Lys Ser Pro Arg Phe Ala Ser Trp Asp Glu Met Asn 
1 5 10 15 

5 
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